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a*he invention relates to isolated nucleic acid molecules promot- 
ing expression of polypeptides in genetically modified eukaryot- 
ic host cells. 

The expression of proteinaceous substances (proteins , peptides , 
polypeptides, fragments thereof, as well as posttranslationally 
modified forms of these molecules are hereinafter referred to as 
"polypeptides" (synonymously used together with "protein", e.g. 
in the example part) in genetically modified cells is a major 
source for providing preparations of such often rare and valu- 
able sxjbstances. For expressing such polypeptides in genetically 
modified host cells, the presence of a DNA region is necessary 
which positively controls ("activates", "promotes") this expres- 
sion. Promoters are important examples for such regions allowing 
RNA polymerases to bind to the nWA for initiating transcription 
into mRNA (Watson et al., "Recombinant DNA" (1992), CSaapter i.l 
and 2) . 

Mosses have gained increasing attention as useful objects for 
research for plant physiology and development, since their 
sintple nature (mosses are situated at the base of higher-plant- 
evolution) provides insights into the coitplex biology of higher 
plants. The single morphology of mosses and the advantageous 
culturing possibilities has made them popular model organisms 
for studies of plant physiology and developmental biology: Moss 
species may be cultured without difficulty under controlled con- 
ditions, using in vitro techniques including axenic culture, not 
only in petri dishes, but also in liquid culture e.g. in biore- 
actors. The haploid gametophyte can be grown photoautotrophic- 
ally in sterile culture and easily observed at the cellular 
level . 



Another major advantage of mosses is their transformation capa- 
city: Despite numerous studies, the ratio of targeted integra- 
tion events in plants hardly reaches 10"*, which prevents the 
general use of gene targeting approaches for plant functional 
genomics. In contrast to all other plants having been tested so 
far, integration of homologous DiNA sequences in the genome of 
mosses (especially the established moss model organisms such as 
Physcomitrella patens (for a review of its molecular genetics: 
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Reski, 1999)) occurs predominantly at targeted locations by ho- 
mologous recombination. Transformation of mosses is usually and 
easily performed via PEG-mediated uptake of plasmid DNA by pro- 
toplasts, DNA transfer by microprojectile bombardment, electro- 
poration and microinjection (Cove et al • , 1997) . Depending on 
the design of the transforming construct predominantly random or 
targeted integration occurs. 

Despite the use of mosses as scientific tools for plant 
physiology research, the use of mosses for producing recombinant 
heterologous polypeptides in moss cells has been rather limited 
so far, although efficient production methods have become avail- 
able (e,g. culturing protonema moss tissue as described in EP 1 
206 561 A) • 

A major limitation of transformation technologies in eukaryotic 
host cells, especially in animal cells or cells of higher 
plants, has always been the lack of an efficient promoter for 
high constitutive expression of foreign genes in such transgenic 
host cells. The cauliflower mosaic virus (CaMV) 35S promoter has 
been widely used for this purpose in a number of plant trans- 
formation systems, however, the CaMV 358 promoter has shown low 
activity in some plant species (specially monocots, such as rice 
(McElroy et al., 1991,)). For monocot transformation the rice 
actin 1 5' region has been used for heterologous expression of 
proteins (McElroy et al., 1991, )• Nevertheless, the continuing 
need to provide novel expression promoting means for the expres- 
sion of recombinant (foreign) polypeptides in genetically modi- 
fied eukaiYOtic host cells still exists. 

For mosses, especially for Physcomitrella patens, up to now, no 
homologous (in this case homologous is defined ajs: moss derived) 
suitable nucleus derived expression promoters or other nucleus 
derived expression promoting sequences have been published so 
far (Holtdorf et al., 2002). Researchers have therefore used 
heterologous (in this case heterologous is defined as: not moss 
derived) promoters for the expression of selection marker genes 
and other genes of interest. However, only a few of such pro- 
moters have been reported to function reliably in certain mosses 
(e.g. the CaMV 35S-promoter; summarised in Holtdorf et al.. 
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2002; CaMV 35S-proinoter does not work in certain other species 
(Zeidler et al., 1999)), Therefore, other means for genetically 
manipulating mosses have been developed in tbe art, e.g, gene- 
trap and enhancer trap systems (Hiwatashi et al., 2001; however, 
also using CaMV 35S promoter) . 

Whereas in the above mentioned research in mosses using homo- 
logous recombination the use of heterologous promoters is ne- 
cessary (and therefore homologous promoters are not needed, 
moreover they are in most cases not useful) , the need for a 
suitable moss derived expression promoting means for industri- 
ally using mosses for the production of recombinant polypeptides 
or for the overexpression of homologous polypeptides is present 
and yet xmsolved. Such expression promoting means should allow a 
stable and constitutive expression under the applied culturing 
conditions and should preferably enable a comparable or even 
higher expression performance as the CaMV 35S promoter. 

Therefore, the present invention provides an isolated nucleic 
acid molecule encoding a moss expression promoting region 
(MEPR) . With the present invention moss derived expression re- 
gions are provided which allow a constitutive expression in ge- 
netically modified host cells, especially mosses, thereby 
addressing the needs for such tools raised in the prior axt 
(Holtdorf et al., 2002; Schaefer et al., 2002). 
An essential feature of the MEPRs according to the present in- 
vention is also that the expression promoting activity of the 
MEPRs is at least 30 %, preferably at least 50 %, of the expres- 
sion promoting activity of a working heterologous promoter in 
the specific host cell (e.g. CaMV 35S for the expression of a 
recombinant polypeptide in Physcomitrella patens), because moss 
promoters which do not have such an expression promoting activ- 
ity cannot be properly used for solving the objects of the 
present invention and are therefore not regarded as MERPs . 

According to the present invention MEPRs promoting expression 
which is not sphorophyte specific, are defined as constitutive 
MEPRs, preferably MEPRs promote expression in gametophyte de- 
rived cells, more preferably MEPRs promote expression in pro- 
tonema cells. 
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According to the present invention constitutive expression is 
preferably defined as the expression of a protein resulting in 
detectable amounts of this protein \inder liquid culture condi- 
tions generally used for photoautotrophically grown mosses, e.g. 
flask cultures, bioreactor cultures (EP 1 206 561 A) conditions 
used for the transient expression system described beneath. 
Therefore, constitutive expression has to be given for the MERPs 
according to the present invention preferably without the need 
of specific culturing additives, preferably also without the 
need of added sugars, phytohormones or mixtures of such stxb- 
stances in the culture medium. The constitutive expression has 
to be performed in a steady mode; yet it can be transients 

The terms ^moss" or ^mosses" as used in the present specifica- 
tion encoKcpasses all bryophytes (hepatics or liverworts, hom- 
worts and mosses) . Characteristic for mosses is their 
heteromorphic Generatlonswechsel , the alternation of two genera- 
tions which are distinct from each other in terms of nuclear DNA 
amounts and morphology- The diploid sporophyte is photosynthet- 
ically active only in its youth and requires supply from the 
dominating, green, haploid gametophyte. The gametophyte exdsts 
in two morphologically distinct forms: the juvenile gametophyte, 
called protonema and the adult gametophyte, called gametophor. 
In contrast to the protonema, the adult gametophyte 
( garnet ophore) bears the sex organs. 

In the context of the presented invention transient expression 
is defined as introduction of an episomal nucleic acid-based 
construct (e.g. MEPRs and gene of interest) as descibed below 
into a moss protoplast and causing or allowing transient expres- 
sion from the vector that results preferably in turn to the se- 
cretion of extracellular protein into the medium. Protoplasts 
are derived from moss cells, preferably, from gametophytic 
cells, more preferably from protonema cells. 

Although the MEPRs according to the present invention may be 
taken from any moss species, the MEPRs are preferably isolated 
from common model moss species. The MEPRs are therefore prefer- 
ably isolated from Physcomitrella, Funaria, Sphagnxun, Ceratodon, 
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Marchantia and Sphaerocarpos , especially of Physcomitrella 
patens, Funaria hygrometrica and Marchantia polymorpha. 

Suitable MEPRs according to the present invention are selected 
from the Seq. ID Nos^ 1 to 27 or expression promoting fragments 
thereof • An ""eaqpression promoting fragment" is a fragm^it of an 
MEPR which has an expression promoting activity of the MEPRs of 
at least 30 %, preferably at least 50 %, of the expression pro- 
moting activity of a working heterologous promoter in the spe- 
cific host cell (e*g* CaMV 35S for the expression of a 
recombinant polypeptide in Physcomitrella patens) . 

The MEPRs according to the present invention may comprise spe- 
cific regions, such as a promoter region ( promo t er , 
5 • tmtranslated regions ( ^5 • -UTRs" ) , 5 " -introns or 3 ' -UTRs • For 
some MEPRS/ expression promoting fragments exist which only con- 
tain the 5'-intron, Usually the promoter is always active alone 
as an expression promoting fragment. Therefore, the MEPR accord- 
ing to the present invention preferably comprises a moss pro- 
moter and preferably a 5'-UTR region and/or a 5'-intron and/ or a 
3'-UTR • 

Although it is often sufficient, if a certain constitutive ex- 
pression is reached, it is in many cases preferred to achieve a 
high expression rate, especially for industrially producing re- 
combinant polypeptides. Most of the MEPRs according to the 
present invention have proven to allow significantly higher ex- 
pression rates for a given recoinbinant polypeptide than the CaMV 
35S promoter, especially in homologous systems (e.g. a Phy- 
scomitrella MEPR for expression of a polypeptide in Phy- 
scomitrella) • Therefore, preferred MEPRs according to the 
present invention have an expression promoting activity being at 
least eciual to the expression promoting activity of cauliflower 
mosaic virus (CaMV) 35S promoter, especially, but not limited, 
in the moss species from which the MEPR was isolated. Even more 
preferred MEPRs have an expression promoting activity being at 
least 200 %, preferably being at least 500%, especially being at 
least 1000 %, of the expression promoting activity of cauli- 
flower mosaic virus (CaMV) 35S promoter, especially, but not 
limited, in the moss species from which the MEPR wias isolated. 
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The isolated nucleic acid molecules according to the present in- 
vention are preferably used to transform a specific host cell 
for producing a recombinant transgenic polypeptide, preferably, 
but not limited to, in an industrial scale. Therefore the nucle-- 
ic acid molecule is provided as a suitable vector allowing 
transformation and expression of the transgene in the host cell. 
Among the possibility that an MEPR according to the present in- 
vention is used for replacing a natural promoter in mosses, 
thereby bringing the expression of a homologous moss polypeptide 
imder the control of a MPER being located at a position in the 
genome of the moss, where it is normally not present in wild 
type strains, the prevalent industrial applicability of the 
present MEPRs is the control of eacpression of a heterologous 
(^foreign") gene in a production host cell, specifically a plant 
cell, especially a moss cell. Therefore, the nucleic acid mo- 
lecule according to the present invention further comprises a 
coding region for a recombinant polypeptide product, said coding 
region being under the control of the MEPR^ 

It is also advantageous, if the isolated nucleic acid molecules 
according to the present invention further comprises a selection 
marker and/or further i^egions necessary for enabling the appro- 
priate transformation method chosen (see e.g. Cove et al., 1997; 
Schaefer, 2002) . For example, if targeted integration is pre- 
ferred, the nucleic acid molecule according to the present in- 
vention should further comprise sequences which are homologous 
to genomic sequences of the species to be transformed. Thus, al- 
lowing targeted integration of the isolated nucleic acid mo- 
lecule via homologous recombination into the genome of the 
species to be transformed. 

Moreover, the isolated nucleic acid molecixles according to the 
present invention can be used for screening and defining con- 
sensus sequences for expression promoting regions. Finding and 
screening for such consensus sequences (regions, boxes) which 
are important and/or essential for expression promoting activity 
is a valuable asset in recombinant DNA technology, especially 
with respect to industrial biotechnology using mosses. 



According to another aspect, the present invention also relates 
to a process for the expression of a recombinant polypeptide 
product in an eukaryotic host cell comprising the following 
steps : 

providing a recoitibinant DNA cloning vehicle coinprising an 
isolated nucleic acid molecule encoding an MEPR according to the 
present invention and optionally a coding region for said recom- 
binant polypeptide product, said coding sequence being under the 
control of the MEPR of said nucleic acid molecule in said host, 

- transforming said eukaryotic host cell which does not naturally 
harbour said coding sequence in a way that it is under the con- 
trol of said MEPR, 

- culturing the transformed eukaryotic host cell in a suitable 
cultvire medium, 

- allowing expression of said recombinant polypeptide and 

- isolating the expressed recombinant polypeptide. 

As mentioned above, MEPRs according to the present invention in 
principle have the capability to achieve constitutive expression 
in various cell types, the eukaryotic host cell is preferably 
selected from plant cells, preferably moss cells, especially 
Physcomitrella patens cells. 

A system which is specifically preferred for the present inven- 
tion is the culturing in moss protonema cultures (protonema moss 
tissue) . In doing so the method described in the EP 1 206 561 A 
and the preferred embodiments thereof are explicitly incorpor- 
ated by reference herein and are iinmediately applicable to the 
present invention. 

The constitutive expression of the polypeptide with the means 
according to the present invention is possible without the need 
for various additives in the culture mediiam, specifically 
without additives for specific differentiation or promoting dif- 
ferent tissue growth. Therefore, besides electrolytes, selection 
agents and medium stabilisers, the culture medium preferably 
does not contain any further additives for cell supply. The cul- 
ture mediijm for stably transformed plants is preferably free 
from added sugars, phytohormones or mixtures thereof. The cul- 
ture medium for transiently transformed protoplasts is prefer- 
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ably free from added phytohormones • 

Preferred moss cells are moss cells of the group Physcomitrella, 
FiHiaria, Sphagnxam, Ceratodon, Marchantia and Sphaerocarpos , es- 
pecially in protonema cultxires* 

According to another aspect, the present invention also provides 
the use of an isolated nucleic acid molecule encoding an MEPR 
for industrially producing a polypeptide, especially for provid- 
ing recombiaaant cells producing said polypeptide. The industrial 
production allows a large scale preparation of a given poly- 
peptide of interest in bioreactors, e.g. in gram amoxints or even 
higher (commercial yields) • This in contrast to the production 
sufficient for research use (mg amounts) or analytical purposes 
(vig amoxmts), which may, of course also be performed by the 
present invention. In transient expression systems, protein 
amounts sufficient for such analytical purposes can easily be. 
obtained with the present DNA molecules. 

Accordingly, the present invention also enconpasses the use of 
an isolated nucleic acid molecule encoding a MEPR for expression 
of a moss polypeptide, the expression of said moss polypeptide 
being not naturally controlled by said MEPR, especially for 
providing recombinant moss cells expressing said polypeptide. 
This use may be reduced to practice both, for research purposes 
and for industrial scale production of moss polypeptides. 

According to another aspect, the present invention also provides 
the use of an isolated nucleic acid molecule encoding a MEPR for 
expression of proteins involved in specific posttranslational 
modifications (e.g. glycosyltransf erases) , especially for 
providing recombinant moss cells expressing polypeptides with 
posttranslational modifications normally not existing or nor- 
mally existing in another ratio in untransformed moss cells. 

According to another aspect, the present invention also provides 
the use of an isolated nucleic acid molecule encoding a MEPR for 
expression of proteins involved in metabolic pathways, espe- 
cially for providing recombinant moss cells altered in their 
contents of metabolites e.g. secondary metabolites. 
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According to another aspect, the present invention aisp provides 
the use of an isolated nucleic acid molecule encoding a MEPR for 
expression of antisense molecules, siRNA molecules or ribozymes 
especially for providing recombinant moss cells with reduced 
amoxints of specific proteins resulting in altered phenotypes 
e.g, morphologically, biochemically. 

According to another preferred aspect, the present invention 
also relates to the use of an isolated nucleic acid molecule en- 
coding an MEPR according to the present invention for recombin- 
ant expression of postranslationally modifying proteins, 
especially for the production of posttranslationally modified 
proteins. With such a technology, it is possible to produce pro- 
teins which are specifically modified postranslationally 
(differently than in the native host cell, thereby enabling e.g. 
plant cells or moss cultures to allow the production of proteins 
with e.g. mammal or even hioman glycosylation patterns. Examples 
wherein such techniques are applied with specific glycosyl trans- 
ferases are described e.g. in WO .00/49153 A and WO 01/64901 A. 

Another preferred use of the isolated nucleic acid molecule en- 
coding an MEPR according to the present invention relates to the 
in vitro expression of recombinant proteins. The technique of in 
vitro translation allows a more controlled production of the re- 
combinant product without the need to accept the xincertainties 
being connected with host cells. 

Another preferred use of the nucleic acid molecule according to 
the present invention is their use for recombinant expression of 
metabolism modifying proteins, e.g. proteins which modify the 
(posttranslational) modification of a translated amino acid 
chain (see e.g. Berlin et al, 1994) . 

The present invention is further illustrated by the following 
examples and the figures, yet without being restricted thereto. 

Figures : 

Fig.l fi- tubulin genes in Physcomitrella patens. 
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Fig. 2 Analysis of expression promoting regions of IS- tubulins in 
Physcomitrella patens. 

Fig. 3 Analysis of expression promoting regions of Ppttib 1 by 

transient transformation of rhVEGF constructs. 

Pig. 4 Analysis of expression promoting regions of Pptub 2 by 

transient transformation of rhVEGF constructs. 

Fig. 5 Analysis of expression promoting regions of Pptub 3 by 

transient transformation of rhVEGF constructs. 

Fig. 6 Analysis of expression promoting regions of Pptub 4 by 

transient transformation of rhVEGF constaructs. 

Fig. 7 Genomic structure of Physcomitrella patens actin genes ^ 

Fig. 8 Comparison of the esqpression activity of the different 

5 'actin regions. 

Fig. 9 Ppactl constructs. 

Fig. 10 Ppact 5 constructs, 

Fig. 11 Ppact 7 constructs. 

Fig. 12 Pp act3::vegf constructs. 

Fig. 13 Ppactl promoter :5' intron siobstitutions , 

Fig. 14 Ppactl promoter :vegf deletion constructs. 

Fig. 15 Ppact3 promoter : vegf deletion constructs. 

Fig. 16 PpactS promoter : vegf deletion constructs. 

Fig. 17 Ppact 7 promoter : vegf deletion constructs. 

Fig. 18 Actin genes in various moss species, and 

Fig. 19 Comparison of promoter sequences of homologous actin 

genes from Physcomitrella patens and Fxmaria hygrometrica 



Haterial and Methods 



Plant: material 



Physcomitrella patens {Hedw. ) B.S.G. has been characterised pre- 
viously (Reski et al. 1994)) . It is a subculture of strain 16/14 
which was collected by H.L.K. Whitehouse in Gransden Wood, Hunt- 
ingdonshire, UK and was propagated by Engel (1968; Am J Bot 55, 
438-446) . 



Standard culture conditions 



Plants were grown axenically under sterile conditions in plain 
inorganic liquid modified Knop medixam (1000 mg/1 Ca(W03)2 x 4 HjO 
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250 mg/1 KCl, 250 mg/1 KH2PO4/ 250 mg/1 NgSO* x 7 H2O and 12,5 
mg/1 FeS04 X 7 H2O; pH 5.8 (Reski and Abel (1985) Planta 165, 
354-358) . Plants were grown in 500 ml Erlenmeyer flasks contain- 
ing 200 ml of culture medium or on 9 cm Petri dishes with solid- 
ified Knop medixom (lOg/1 agar) • Flasks were shaken on a Certomat 
R shaker (B.Braun Biotech International, Germany) set at 120 
rpm* Conditions in the growth chamber were 25 +/- 3^C and a 
light-dark regime of 16:8 h. Cultures were illuminated from 
above by two fluorescent tubes (Osram L 58 W/25) providing 35 
micromol/m-^s"^. Subculturing of liquid cultures was done once a 
week by disintegration using an Ultra-Turrax homogenizer (IKA, 
Staufen, Germany) and inoculation of two new 500 ml Erlenmeyer 
flasks containing 100 ml fresh Knop medium. Additionally, cul- 
tures were filtered 3 or 4 days after disintegration and were 
transferred into fresh Knop mediiam. 

Bioreactor cultures were grown in Knop medium or in 1/10 Knop 
medium, respectively, in stirred tank glass bioreactors 
(Aplikon, Schiedam, The Netherlands) with a working volxime of 5 
liters (as described in Hohe and Reski, Plant Sci. 2002, 163, 
69-74) . Stirring was performed with a marine impeller rxmning 
with a speed of 500 rpm, the cultures were aerated with 0.3 wm 
[(aeration voltame) / (medium volume) /min] air. The cultxare temper- 
atirce of 25^C in the vessel was controlled by a doiable jacket 
cooling system. Light intensity was 50 micromol/m'^^s'^ provided by 
fluorescent tubes (Osram L 8W/25) with a light/dark rhythm of 
16/8 h. The pH- value in the culutures (pH 6.5 - 7.0) was not ad- 
justed. 

Protoplast; isolation 

Different protocols for the isolation of protoplasts (Grimsley 
et al. 1977; Schaefer et al. 1991; Rother et al, 1994; Zeidler 
et al. 1999; Hohe and Reski 2002; Schaefer 2001) have been de^ 
scribed for Physcomitrella patens. For the work presented 
herein, a modification/ combination of the previously described 
methods was used: 

Moss tissue was cultivated for 7 days in Knop medixm with re- 
duced (10%) Ca(N03)2 content. Cultures were filtered 3 or 4 days 
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after disintegxation and were transferred into fresh Rciop medixim 
with reduced (10%) Ca(N03)2 content* After filtration the moss 
protonemata were preincxxbated in 0*5 M rtiannitol. After 30 min, 
4% Driselase (Sigma^ Deisenhofen, Germany) was added to the sus- 
pension. Driselase was dissolved in 0*5 M mannitol (pH 5.6-5*8), 
centrifuged at 3600 rpm for 10 min and sterilised by passage 
through a 0.22 filter (Millex GP, Millipore Corporation, 
USA) . The suspension, containing 1% Driselase (final concentra- 
tion) , was incubated in the dark at RT and agitated gently {best 
yields of protoplasts were achieved after 2 hours of 
incubation) . The suspension was passed through sieves (Wilson, 
CLF, Geimany) with pore sizes of 100 micrometer and 50 micromet- 
er. The suspension was centrifuged in sterile centrifuge tubes 
and protoplasts were sedimented at RT for 10 min at 55 g 
(acceleration of 3; slow down at 3; Multifuge 3 S-R, Kendro, 
Geirmany) . Protoplasts were gently resuspended in W5 medium (125 
mM CaCla X 2 H2O; 137 inM NaCl; 5.5 itOyE glucose; 10 mM KCl; pH 5.6; 
660-680 mOsm; sterile filtered; Menczel et al. 1981). The sus- 
pension was centrifuged again at RT for 10 min at 55 g 
(acceleration of 3; slow down at 3; Multifuge 3 S-R, Kendro, 
Germany) . Protoplasts were gently resuspended in W5 medium. For 
coiinting protoplasts a siaall volume of the suspension was trans- 
ferred to a Puchs-Rosenthal-chamber . 

Transient Transf ozmatlon 

Different protocols for transformation (Schaefer et al. 1991; 
Reutter and Reski 1996, Schaefer 2001) have been described for 
Physcomitrella patens. For the work presented herein, a modific- 
ation/ coitODination of the previously described methods was used: 

For transformation protoplasts were incubated on ice in the dark 
for 30 minutes. Subsequently, protoplasts were sedimented by 
centrifugation at RT for 10 min at 55 g (acceleration of 3; slow 
down at 3; Multifuge 3 S-R, Kendro). Protoplasts were resxxspen- 
ded in 3M medium (15 inM CaCl2 x 2 H2O; 0.1% MES; 0.48 M mannitol; 
pH 5.6; 540 mOsm; sterile filtered, Schaefer et al. (1991) Mol 
Gen Genet 226, 418-424) at a concentration of 1.2 x 10^ proto- 
plasts/ml. 250 microliter of this protoplast suspension were 
dispensed into a new sterile centrifuge tube, 50 microliter DNA 
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solution (colxiinn purified nNA in H2O (Qiagen, Hilden, Germany, 
Hilden, Germany) ; 10-100 microliter optimal DNA amoixnt of 60 mi- 
crogram was added and finally 250 microliter PEG-solution (40% 
PEG 4000; 0.4 M mannitol; 0.1 M Ca(N03)2; pH 6 after autoclaviAg) 
was added. The suspension was immediately but gently mixed and 
then inciobated for 6 min at IIT with occasional gentle mixing. 
The suspension was diluted progressively by adding 1, 2, 3 and 4 
ml of 3M medium. The suspension was centrifuged at 20°C for 10 
minutes at 55 g (acceleration of 3; slow down at 3; Multifuge 3 
S-R, Kendro) . The pellet was resuspended in 400 microliters 3M 
medium. Cultivation of transformed protoplasts was performed in 
48 well plates (Cellstar, greiner bio-one, Frickenhausen, Ger- 
many) . 

Transient transformations were incubated in dim light (4.6 mi- 
crcMaiols-lm-2) at 25oc. Samples were tcLken after 24h and 48h, re- 
spectively, by carefully replacing half of the medium (200 
microliters) by fresh medium. The medium was not replaced com- 
pletely since the protoplasts have to be kept in liquid. The re- 
moved medivim (including recombinant protein) was stored at 
-200c. The 48h samples were measured in an ELISA. 

Stable fcransfonnafcion 

Different protocols for trans fortnat ion (Schaefer et al. 1991; 
Reutter and Reski 1996, Protocol Schaefer 2001) have been de- 
scribed for Physcomitrella patens. For the work presented 
herein, a modification/ combination of the previously described 
methods was used; 

For trans foinnat ion protoplasts were inciabated on ice in the dark 
for 30 minutes. Subsequently, protoplasts were sedimented by 
centrifugation at RT for 10 min at 55 g (acceleration of 3; slow 
down at 3; Multifuge 3 S-R, Kendro) . Protoplasts were resuspen- 
ded in 3M medivmi (15 im CaCla x 2 H2O; 0.1% MES; 0.48 M mannitol; 
pH 5.6; 540 mOsm; sterile filtered, Schaefer et al. (1991) Mol 
Gen Genet 226, 418-424) at a concentration of 1.2 x 10* proto- 
plasts/ml. 250 microliter of this protoplast suspension were 
dispensed into a new sterile centrifuge tiabe, 50 microliter DMA 
solution (column purified DNA in H2O (Qiagen, Hilden, Germany, 
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Hilden, Germany) ; 10-100 laicroliter optimal DNA amoimt of 60 mi- 
crogram was added and finally 250 microliter PEG-solution {40% 
PEG 4000; 0.4 M mannitol; 0.1 M Ca(N03)2; PH 6 after autoclaving) 
was added. The suspension was immediately but gently mixed and 
then incTobated for 6 min at RT with occasional gentle mixing. 
The suspension was diluted progressively by adding 1, 2, 3 and 4 
ml of 3M medium. The suspension was centrifuged at 20^C for 10 
minutes at 55 g (acceleration of 3; slow down at 3; Multifuge 3 
S-R, Kendro) . The pellet was re-suspended in 3 ml regeneration medi- 
um. Selection procedure was performed as described loy Strepp et al. 
(1998) . 

EIiXSA 

Recombinant VEGF121 expressed by transient transformed moss pro- 
toplasts was quantified by ELISA (R&D Systems, Wiesbaden, Ger- 
many) . The ELiISA was performed according to the instructions of 
the manufacturer. The samples were diluted for quantification. 

Bacterial strains and cloning vectors 

For all cloning and propagation experiments Escherichia coli 
strain ToplO (Invitrogen, Karlsruhe, Geannany) was used. For 
cloning of DNA- fragments pCR2.1-TOPO (Invitrogen, Karlsrxihe, 
Germany) , pCR4-TOPO (Invitrogen, Karlsruhe, Germany) , pZErO-2 

(Invitrogen, Karlsruhe, Germany) or pRTlOl (Topfet et al. 

(1987), NAR, 15, p5890) were used as vectors. 

Genomic Z»9A$ preparaticm, digestion, ligation 

Physcomitrella patens genomic DNA was isolated from 13 days old 
protonemata following the CTAB protocol (Schlink and Reski, 
2002) . 

Genomic PNA (3-5 micrograms) was digested with 30 units of vari- 
ous restriction endonucleases (e.g. BamHI, EcoRI, Hindlll, Kpnl, 
Ncol, Ndel, Pael, PagI, Xbal; all MBI Fermentas, St* Leon-Rot, 
Germany) in a total volume of 30 microliters for two hours at 
37^0, using one endonuclease per digest. Digested DNA was puri- 
fied using PGR Purification Columns (Qiagen, Hilden, Germany) , 
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following the suppliers manual (30 microliters digest + 200 mi- 
croliters buffer PB) . Elution was done in 50 microliters Elution 
Buffer (EB; Qiagen, Hilden, Germany) . Prior further treatment, 
10 microliters of the eluate were analysed on an agarose gel 
(0,5%) • 

The remaining DNA was religated with 5 imits T4 Ligase (MBI Per- 
mentas, St^ Leon-Rot, Germciny) in a total volume of 300 microl- 
iters for two hours at RT and additional two days at 4^0. Prior 
addition of the enzyme ligation mixtures were put for five 
minutes at 50^C and then on ice, in order to melt sticky end 
basepairing. After ethanol precipitation with 0,3 M Na-acetat 
(pH 4.8) and two washes with 70% ethanol the religated DNA was 
resuspended in 200 microliters EB . One to three microliters of 
this religated genomic DNA were used for X-PCR» 

BNA Pr^aral^lon 

Physcomitrella patens total RNA was prepared by grinding tissue 
under liquid nitrogen and by the usage of E^Z.N,A. Plant RMA Kit 
(PeqLab) or RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) fol- 
lowing the suppliers manuals. Total RNAs were gel analysed, 
quantified (OD260), and stored at -20^C or -80^0, respectively. 

DNase treatment and First Strand cDNA Ssrnthesis 

1 microgram of total RNAs was DNase (GIBCO BKL) digested in a 
total volume of 11 microliters, following the suppliers manual. 
4,5 microliters of this DNase treated total RNA (~400ng) was 
used with Oligo dT (12-18) primers and SUPERSCRIPT II RNase H Re- 
verse Transcriptase (GIBCO BRL) to prepare first strand cDNA, 
following the suppliers manual. The resulting cDNA was 10 times 
diluted with sterile ddHaO and stored at -20^C. 

PGR in generaX 

If not indicated in particular PCRs were done with Advantage 
CDNA Polycaerase Mix (BD Biosciences Clontech, Heidelberg, Ger- 
many) . For all other PCR-approaches the following DNA poly- 
merases were used: Taq recombinant polymerase (MBI Ferment as. 
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St. Deon-Rot, Germany), Pfu native polymerase (MBI Fermentas, 
St. Leon-Rot, Germany), Platinum Pfx DNA polymerase (Invitrogen, 
Karlsruhe, Germany) or TripleMaster PGR System {Eppendorf , Ham-- 
burg, Germany) . Licenced Thermo -cyclers were Mastercycler gradi- 
ent (Eppendorf, Hamburg, Germany) . All primers were synthesised 
by MWTG Biotech AG, Ebersberg, Germany. For PGR product purifica- 
tion or gel elution GPX PGR DNA and Gel Band Purification Kit 
(Amersham Bioscience, Freiburg, Germany) was used, following the 
suppliers manual. 

Construction and Cloning o£ Recondbinant Plasmids 

Conventional moleculcir biology protocols were essentially as de- 
scribed by Saitibrook et al. (1989), Molecular Cloning; A r,aborat- 
ory Manual, 2nd edn. Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press. 

Inverse PCR (X-PCR) & nested PCR 

I-PCR was done with 0.25 microliters Advantage cDNA Polymerase 
Mix and buffer (including 3,5 mM Mg(OAc)2, both BD Biosciences 
Clontech, Heidelberg, Germany), 0.2 irtM each primer, 0.2 mM dNTPs 
and one to three microliters of genomic religations (see above) 
in a toatl volxame of 25 microliters. Cycling conditions were: an 
initial step of 2 minutes at 96°C, then 20 seconds 96^C, 10 
seconds initially 67 ^C (touchdown: -0 .15^C/cycle) and 10 minutes 
68^C as a second step, with 35 to 40 repetitions, followed by a 
terminal step of 20 minutes at 68^C and cooling to A^C at the 
^d of the program • PCR products were eluted from agarose gels. 
Elution was done in 30 microliters. Eluted PCR products were 
either cloned directly in TOPO TA vectors (pCR4-TOPO, Invitro- 
gen, Karlsruhe, Germany) or used as template for reconfirmation 
in nested PCRs. In the latter case gel eluted, nested PCR 
products were cloned in TOPO TA vectors (pCR4-TOPO, Invitrogen, 
Karlsruhe, Germany), Cycling conditions for nested PCRs were: an 
initial step of 1 minutes at 96^C, then 20 seconds 94^C, 10 
seconds 56^C and 4 minutes 68^C as a second step, with 25 repe- 
titions, followed by a terminal step of 10 minutes at 68 ^C. 

Generation of pRTlOlnew for cloning of aiqplif ied promoter frag- 
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ments 

pRTl01p21 (Gorr 1999) was reaiaplified with Pfu native polymerase 
(MBI Fermentas, St. Leon-Rot, Germany) using primer 320 and 321 
(for this and all subsequent primers see Table 1) . Primer 320 

(forward) starts at the 2nd codon (5'-(atg)aac ) of the VEGF 

signal peptide. Primer 321 (reverse) starts in the middle of the 
Hindi site within the multiple cloning site in front of the 35S 
promoter (5'-gac...). An additional Xhol site was introduced 
with primer 321. Religation of the PGR product resulted in loss 
of the 35S promoter and the reconstitution of a HincIX site. 
The sequence of the VEGF gene was verified by sequencing. This 
new vector was called pRTlOlnew and used for cloning of expres- 
sion promoting regions via the Xhol or Hindi site, respect- 
ively, in front of the reporter gene. 

Seoaencing 

All sequencing reactions were performed by SEQIiAB Sequence 
Laboratories, GQttingen, Germany 

Software 

Sci Ed Central, Clone Manager Suite were used for primer design, 
pairwaise and multiple sequence alignments. Ijasergene, DNASTAR 
(Version 5) Megalign and Se<SSfen was used for analysing sequen- 
cing data. Homology searches were carried out by BLAST 2 
(Altschul et al., 1997). 

EXAMPLES 

The present invention is illustrated by four examples for moss 
expression promoting regions: first, the isolation and analysis 
of various members of a family of txabulin e^^pression promoting 
regions of Physcomitrella patens. In the second exairple expres- 
sion promoting regions for the actin gene family from a variety 
of different mosses are provided. The third and fourth example 
deals with ubiquitin expression promoting regions and with RBCS 
expression promoting regions. 
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EX2\MPXi£ 1: Cloning and analysis of Physcomitrella patens fi-t:u-* 
bulin genes and their expression promoting regions. 

Overview 

In order to get S-tixbulin (tixb) regulatory /promoter sequences 
from Physcomitrella patens (Pp) in a first step coding sequences 
of B-tiibulin homologues were isolated by polymerase chain reac- 
tion (PGR) . Therefor an alignment of all nine published tu- 
bulin genomic sequences from Arabidopsis thaliana (Attoxb 1-9) 
were used to design primers within highly conserved coding re- 
gions (8F, 9F and lOR; for this and all svibsequent primers see 
Table 1) • In addition, sequence information of public EST data 
from Physcomitrella patens were used, but only three did show 
homologies to S-tubulins . One of which was used to design a 
gene-specific primer (F7) upstream of the predicted coding re- 
gion. Sequence comparison of all cloned PGR products, generated 
with the primers mentioned and EST data lead to 3 groups of 
clones with identical DNA within but differences between groups, 
mainly, but not exclusively, due to differences within introns. 
This S-txibulin orthologues were named Pptub 1, Pptub 2 and Pptub 
3 , respectively . 

Furtheannore, since during the running project, more EST data 
were available (more than 50000 new entries in NGBI/dbEST with 
beginning of 2002), a detailed analysis of all 121 Phy- 
scomitrella patens ESTs with high similarity to S-tubulin lead 
to three additional new upstream and three downstream groups of 
ESTs, being identical within a group but neither identical to 
any other group nor to Pptixb 1-3 . PGR with primers derived from 
predicted noncoding upstream and downstream regions (see below) 
from each new group and permuting all primer combinations helped 
to correlate corresponding upstream and downstream groups to a 
particular locus, named Pptub 4, Ppttib 5 and Pptub 6, respect- 
ively. Both, genomic and cDNA amplificates of all three new loci 
were cloned and sequenced, raising the nuinber of S-tubulin or- 
thologues in Physcomitrella patens to six. 

Pptub 1 to 4 (in contrast to Pptxxb 5 and 6) are much more fre- 
quently represented in EST databases. Corresponding cDNA librar- 
ies were produced using RNA mainly from protonema and young 



- 19 - 

gametophore. So, for this four genes oiHy, based on the gained 
sequence data, an inverse PGR approach (I-PCR) was performed in 
order to walk into flanking genomic regions • 

Pptub 1 

As already mentioned in a first step, Taq (MBI Fermentas, St* 
Leon-- Rot, Gexmany) PGR fragments from two independent PCRs on 
Physcomitrella patens genomic DNA using primers 8F and lOR were 
cloned. One clone (2-1) and two clones (8-1, 8-2), respectively, 
from each PGR were sequenced partially and turned out to be 
identical. The corresponding locus was named Pptub 1. 

This preliminary sequence information was used to design primers 
in order to perform a genomic walk into flanking regions of 
Pptub 1, using an I- PGR approach on religated EcoR I suid Hind 
III genomic digests (primers 35, 36) . Reconfirmation of products 
was done by nested PGR (primers 40, 38) . Two clones generated by 
nested PGR products (E#l and H !•?) were sequenced completely. 

The Hind III, clone H 1,7 did not harbour an internal Hind III 
site, most likely due to star activity of the enzyme or ligation 
of a random ds breakage. However, sequences upstream of the 
first EcoR I site were confirmed by two independent PCRs on gen- 
omic DNA {primers 113, 67 and 113, 90) • In addition, an addi- 
tional cDNA (89, 91; Pfu native (MBI Fermentas, St* Leon-Rot, 
Germany) ) PGR product was cloned. 

All mentioned clones helped to generate and reconfirm sequence 
data. In total -1500 bp upstream of the startcodon and -1500 bp 
downstream of the stopcodon were gained. 

Pptub 2 

As already described above sequence information of p\iblished 
ESTs from Physcomitrella patens was used to design a gene-spe- 
cific primer (F7) upstream of the predicted coding region. PGR 
on Physcomitrella patens genomic DNA (primers F7, ICR) and svib- 
sequent cloning and sequencing of the PGR product proofed that 
it, together with all three so far published Pptub ESTs (Pptub 
EST 1-3) belong to one locus, named Pptixb 2. Intron positions 
could be verified by comparing EST with genomic sequences. 
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This preliminary sequence information was used to design gene- 
specific primers within introns (primers 95 and 71) in order to 
perform a genomic walk into adjacent genomic regions of Ppttib 2, 
using an I-PCR approach on religated Pag I, BamH I and Nde I ge- 
nomic digests. PGR products were reconfirmed by nested PGR 
(primers 38, 35) ♦ Two clones generated by nested PGR products 
{G#2Pag and D#2Nde) were sequenced completely. The Nde I clone 
D#2 did not harbour an internal Nde I site^ most likely due to 
star activity of the enzyme or ligation of a random ds breakage. 
However, sequence data were confianned by C#2Pag and a third I- 
PGR clone (95#8BamHI; primer 149 and 71) . In addition two inde- 
pendent PGRS on genomic DNA (primers 205, 149; Taq (MBI Per- 
mentas, St. Leon-Rot, Germany) and primers 205, 206) confirmed 
product length. The 205-206 PGR product and an additional genom- 
ic downstream PGR product (primers 71, 2 06; Pfu native (MBI Fer- 
mentas, St. Leon-Rot, Germany)) were cloned and helped to verify 
sequence data. 

All mentioned clones helped to generate and reconfirm sequence 
data. In total -1400 bp upstream of the startcodon and --1400 bp 
downstream of the stopcodon were gained. 

Pptiib 3 

As already mentioned in a first step, Taq (MBI Fermentas, St. 
Leon-Rot, Germany) PGR fragments from two independent PGRs on 
Physcomitrella patens genomic and cDNA using primers 9P and lOR 
were cloned. Clones from each PGR (#3-3 genomic, #4-3 cDNA) 
were sequenced partially and turned out to be identical. The 
corresponding locus was named Pptub 3 . 

This preliminary sequence information was used to design gene- 
specific primers within introns (primers 69, 70) in order to 
perform a genomic walk into adjacent regions of Pptub 3, using 
an I -PGR approach on religated Pag I and Nco I genomic digests. 
Reconfirmation of PGR products was done by nested PGR (primers 
38, 35). Two clones (A#lNco and #4-lPag) were sequenced com- 
pletely. A#lNco is a clone generated by a nested PGR product 
(38, 35) whereas #4-lPagl was generated by the original I-PGR 
product (69, 70) . In addition a genomic PGR product (primers 
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203, 204) was cloned and helped to verify sequence data. 

All mentioned clones helped to generate and reconfirm sequence 
data. In total -1900 bp upstream of the startcodon and -1100 bp 
downstream of the stopcodon were gained. 

Pptiib 4 

As already mentioned, in case of Pptxib 4, EST data were used to 
design gene-specific downstream and upstream primers (297, 299) 
in order to generate genomic and cDNA clones. Additional genomic 
clones using native Pfu polymerase (MBI Fearmentas, St. Leon-Rot, 
Germany) helped to verify sequence data. 

Primer 297 and 299 were inverted (337, 383) and used to perform 
a walk into adjacent genomic regions of Pptxxb 4, using an I-PCR 
approach on religated Nde I and Nco I genomic digests. Two 
clones (48#2Nco and A02#3Nde) and additional genomic clones 
(primers 547 and 374; Advantage cDNA Polymerase Mix (BD Bios- 
ciences Clontech, Heidelberg, Germany) and Triple Master 
(Eppendorf, Hamburg, Germany)) were generated. 

All mentioned clones helped to generate and reconfirm sequence 
data. In total -2300 bp upstream of the startcodon and -1100 bp 
downstream of the stopcodon were gained. 

Fp1:ub 5 and 6 

As already mentioned, in case of Pptub 5 and 6, EST data were 
used to design gene-specific downstream and upstream primers 
(Pptxab 5: 298, 300 and Pptub 6: 296, 336) in order to generate 
genomic and cDNA clones of each gene. In case of Pptxib 5, addi- 
tional genomic clones using native Pfu polymerase (MBI Fer- 
mentas, St. Leon-Rot, Germany) helped to verify sequence data. 

All mentioned clones helped to generate and reconfirm sequence 
data. In total 2031 bp genomic sequence for Pptub 5 and 3161 bp 
genomic sequence for Pptub 6 were gained. 

Cloning strategies 

Preliminary Pptub 1 (2-1, 8-1, 8-1; all genomic) and Pptub 3 (3- 
3 genomic, 4-3 cDNA) clones were generated with Taq recombinant 
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polymerase. PGR products were ligated into TOPO TA vectors 
(pCR4-TOPO, Invitrogen, Karlsnihe, Germany) • PGR conditions 
were: 2.5 unit Taq recombinant polymerase, enzyme buffer, 3.3 mM 
MgGla {all MBI Fermentas, St. Leon-Rot, Germany), 0.4 mM each 
primer, 100 nanograms of cDNA or genomic DNA as template in a 
total volume of 25 microliters. Cycling conditions were: an ini- 
tial step of 5 minutes at 95^C, then 45 seconds 95 ^C, 10 seconds 
60^C (primer 8P) or 65^C (primer 9F) and 1 minute 12^C as a 
second step, with 30 to 35 repetitions, followed by a terminal 
step of 5 minutes at 72<>C and cooling to at the end of the 

program. 

All other genomic and cDNA clones were 



Pptub 1: 
Pptub 2; 
Pptub 3: 
Pptub 4 J 
Pfu) 

Pptub 5: 
Pptub 6: 



113-67 . 113-90 , 89-90 . 89-91 cDNA 

F7/R10 . 205-206 . 71-206 

203-204 

547-374 (+ TrippleM^ster) , 297-299 cDNA + genomic ( + 

298-300 CDMA + genomic (+ Pfu) 
296-336 CDNA + genomic 



Underlined clones above were generated with. Advantage cPNA Poly- 
merase Mix, using 0.25 microliters enzyme mix, buffer (including 
3,5 mM Mg(0Ac)2, both BD Biosciences Glontech, Heidelberg, Ger- 
many) , 0*25 mM each primer, 0*25 mM dNTPs and 10-20 nanograms of 
template per 20 microliter PGR. Cycling conditions were: an ini- 
tial step of 2 minutes at 96^C, then 20 seconds 96^0, 10 seconds 
60^C and 2 minutes/kb 68®C as a second step, with 35 to 40 repe- 
titions, followed by a terminal step of 15 minutes at 68 and 
cooling to 4**C at the end of the program. PGR products of ap- 
propiate length were eluted from agarose gels, Elution was done 
in 30-50 microliters, depending on amoxmt of amplificate. Eluted 
PGR products were cloned in TOPO TA vectors (pCR4- 
TOPO, Invitrogen, Karlsruhe, Germany) . 



All other clones were generated with pfu native polymerase, as 
were the two additional genomic clones 297-299 and 298-3 00, us- 
ing 0,3 microliters polymerase (= 0.75 imits) , buffer, 2-4 mM 
MgS04 (all MBI Permentas, St. Leon-Rot, Germany), 0.25 niM each 
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primer, 0.2 iriM dNTPs and 10-20 nanograitis of template per 20 mi- 
croliter PGR. Cycling conditions were: an initial step of 2 
minutes at 96^C, then 20 seconds 96^C, 10 seconds 60^C and 2 
minutes /kb 72^C as a second step, with 35 to 40 repetitions, 
followed by a terminal step of 10 minutes at 72^C and cooling to 
4°C at the end of the program • PGR products of appropriate 
length were eluted from agarose gels, Elution was done in 30-50 
microliters, depending on amount of amplificate. Eluted PGR 
products were cloned in pZErO-2 (Invitrogen, Karlsruhe, Germany) 
linearised with EcoRV. 

An additional clone of 547-374 was generated with the TripleMas- 
ter PGR System, using 0.25 microliters polymerase iriix (= 1.25 
xmits), tuning buffer (including 2.5 itM Mg^*, both Eppendorf, 
Hamburg, Germany), 0.2 mM each primer, 0.2 mM dNTPs and 10-20 
nanograms of template per 20 microliter PGR. Gycling* conditions 
were: an initial step of 2 minutes at 96^G, then 20 seconds 
96^C, 20 seconds 60^C and 3 minutes 72^C as a second step, with 
40 repetitions, followed by a terminal step of 10 minutes at 
72<^C and cooling to 4^C at the end of the program. PGR products 
of appropriate length were eluted from agarose gels* Elution was 
done in 30-50 microliters, depending on amount of amplificate. 
Eluted PGR products were cloned in TOPO TA vectors (pGR4-TOPO, 
Invitrogen, Karlsruhe, Germany) . 

In summary, PGR on genomic DNA of Physcomitrella patens and 
cloning of PGR products lead to sequence information of six 
transcribed Physcomitrella patens tubulin genes • Additionally, 
EST and cDNA data were used to confirm genomic sequence data and 
intron/exon borders. In case of Pptub 1 to 4 inverse PGR lead to 
non transcribed flanking 5 ' and 3 ' genomic sequences . A general 
oveirsriew of all six genomic regions is given in Figure 1. 

Gene struct^uare & Conservation 

As already stressed, Pptiab 1 to 4 are most abimdantly represen- 
ted in EST databases. In addition the great majority of their 
corresponding ESTs were raised from full length cDNA libraries. 
This two facts helped to determine the transcriptional start 
site (TSS) of Pptub 1 to 4 in silxco. A multiple alignment of 5' 
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ESTs against corresponding upstream genomic regions showed that 
Pptub Ito 3 do have a precise transcriptional initiation: 20 out 
of 27 5' ESTs for Pptub 1, 16 out of 20 5' ESTs for Pptub 2 and 
9 out of 14 5' ESTs for Pptub 3, do start at the same, most up-- 
stream position, marked with +1 (Figure 3-6) • In addition all 
three TSSs axe surrounded by a consensus sequence (see below) . 
In case of Pptub 4 the 23 5' ESTs indicate multiple TSSs within 
100 bp. The start site of the most upstream 5' EST was defined 
as +1. 

An analogous multiple alignment of 3 ' ESTs against corresponding 
downstream genomic regions reconfirmed that plant genes almost 
always come with more than one poly (A) site and that consensus 
sequences are much less sharply defined than in e^g. mammalian 
genes, in which the sequence AAUAAA is nearly ubiquitous (for 
review see: Rothnie et al., 1996) • 

The six cloned loci of Physcomitrella patens did not show any 
nonsense stop-codons and proper proteins with high similarities 
to known IS- tubulins could be predicted. Outside the coding re- 
gions generally, the similarity drops immediately and signific- 
antly. Concerning 5' putative regulatory elements a detailed 
coHiparison of all four upstream regions revealed no overall con- 
servation within the gene family or to 5 ' regions of other known 
plant S-tvibulin genes. However, some interesting matches of con- 
servation within the gene family could be detected: 

a) The determined TSSs of Pptub 1 to 3 in all three cases fall 
within the consensus sequence T/C C A(+l) G/C T G T G C and are 
embedded in C/T-rich regions (compare consensus of 171 xinrelated 
TATA plant promoters: T/C C A(+l) N M N in plantProm Database 
under http://mendel.cs,rhul. ac.uk/mendel. php?topic==plantprom) . 

b) 22-24 bp upstream of the TSS -which is within the typical 
distance for plant TATA promoters (see plantProm DB) - a weak 8 
bp TATA box embedded in a conserved stretch of 20-25 bp can be 
found in Pptub 1 to 3 . The TATA box consensus from 171 unre- 
lated plant promoters is: Tgg A95 Tgg Aioo A62/T38 A^7 Tgi/Aaa A73 (see 
plantProm DB) and for Pptub 1-3 is: T tTATcT c/t/A^ with 
capitals indicating correlation to consensus . 

c) all four genes do have a very low degree of Adenosine (9- 
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16%) in their 5'UTRs. 

d) The 5' UTR of Pptub 4 has an overall C/T content of 74%, 
which --in addition- harbours a C/T stretch 50 i>p) , directly 
behind the start point of the shortest, most doximstreaia 5' EST, 

e) . Pptub 2 harbours a 40 bp polyA stretch aroimd 450 bp up- 
stream of the TSS (-450 until -489) • 

f) In Pptub 1 and 4 upstream of app. position -420 long very 
A/T-rich regions begin (Pptiob 1 over 80% A/T for nearly 900 bp 
and Pptub 4 75% A/T for 1750 bp , rendering open the possibility 
for the location scaffold/matrix attached regions (S/MARs; 
(Liebich et al., 2002) upstream of this genes. 

Functional CSharacbexrlzation & Quantification of fi-tubulin pro- 
moters 

Definition of minimal promoter- fragments giving a maximxjm of 
promoter activity was done by ftmctional (juantif ication of pu- 
tative 5' regulatory sequences of Pptxib 1 to 4 in a transient 
expression system, using nonregenerating Physcomitrella patens 
protoplasts as expression system » For each promoter several con- 
structs of different lengths including upstream regions and 5 ' 
UTRs, were brought precisely in front of the startcodon of the 
reporter gene. As reporter gene a human protein (recombinant hu- 
man vascular endothelial growth factor 121 r rhVEGF121; Gorr 
1999) was secreted into the medium via its own signalpeptide. 
The amount of rhVEGF121 in the supernatant of the moss culture 
was quantified by an EDISA and reflected the strength of the 
promoter or promoter fragment in the system. Values were related 
to values obtained by the 35S promoter. Each construct was 
transformed a minimum of six times in two to three different 
transformation experiments. Samples were taken after 24 and 48 
hours / respectively, with 48 hour samples measured twice in ap- 
propriate dilutions in an ELISA. An overview of the results is 
given in Figure 2 . 

The expression promoting regions of Pptxib 1 to 4 are disclosed 
as Seq.lD.Nos. 1 to 8. 

Cloning of axqplif led pramoter fragments of Pptub 1 and 4 into 
pRTlOlnew 
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Pptub 1; 



1- 


■0 


(primer 


364XhoI, 


363cat) 


1- 


•1 


(primer 


219X110I, 


3 63cat) 


1- 


•3 


(primer 


549XlioI, 


3 63cat) 


1- 


■4 


(primer 


226X110I, 


363cat) 


1- 


•5 


(primer 


550Xhol, 


3 63cat) 



Pptub 2: 2-0 (primer 291, 225cat) 

Pptub 3: 3-0 (primer 292, 223cat) 

Pptub 4: 4-0 (primer 373XhoI, 374cat) 

4-1 (primer 548XhoI, 374cat) 



The promoter fragments given above were amplified with Pfu nat- 
ive polymerase (MBI Fermentas, St. Leon-Rot, Germany) on genomic 
DNA using reverse primers starting with the reverse complement 

sequence of the ATG start codon (cat ) and, in part, forward 

primers containing Xhol sites • PGR products were cut Xhol and 
ligated into XhoI/HincII or not cut at all and and ligated into 
Hindi opened pRTlOlnew, respectively. Generated clones were 
verified by sequencing. Clone 1-2 (XhoI/EcoRI) , 2-1 (Bglll) , 2- 
2 (Sail), 2-3 (EcoRI/Sall) , 2-4 (EcoRI/Sall) , 3-2 (Sail), 3-3 
(Ecol47I/HincII) , 3-4 (Xhol/Sall) were generated by internal de- 
letions of longer clones. The remaining vectors were gel-eluted 
and religated- In case single strand overhangs did not fit, lig- 
ation was performed after filling- in of recessed 3 '-termini with 
Klenow Fragment (MBI Fermentas, St. Leon-Rot, Germany) , follow- 
ing the suppliers manual. 



Pptub 1 

Six different promoter lengths were cloned into the transforma- 
tion vector pRTl01p21 in front of the reporter gene. The data of 
all constructs are given in figure 3. (5' UTR = +1 (TSS) xintil 
+226, +227= ATG) 



1-0 -1307 bp (1533 bp 5' region of Pptub 1) 

1-1 - 985 bp (1211 bp 5' region of Pptub 1) 

1-2 - 416 bp (642 bp 5' region of Pptub 1) 

1-3 - 248 bp (474 bp 5' region of Pptub 1) 
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1-4 - 83 bp (309 bp 5' region of Ppt\ib 1) 

1-5 - 71 bp (297 bp 5' region of Pptub 1) 

Promoter fragment 1-2 can be defined as the shortest promoter 
fragment giving high expression rates ♦ The rates are app. 150% 
compared to values generated with the 35S promoter, which was 
set to 100%. Note that upstream of the minimal promoter fragment 
1-2 a long, very A/T rich region starts (over 80% A/T for nearly 
900 bp) . 

Pptub 2 

Five different promoter lengths were cloned into the 
tion vector pRTl01p21 in front of the reporter 
gene. The data of all constructs are given in Figure 
= +1 (TSS) until +122, +123= ATG) 
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Promoter fragment 2-2 can be defined as the shortest promoter 
fragment giving high expression rates. The rates are comparable 
to values generated with the 35S promoter (100%) . 

Ppttib 3 

Different promoter lengths were cloned into the transformation 
vector pRTl01p21 in front of the reporter gene. The data of four 
constructs are given in Figure 5. (5' UTR = +1 (TSS) until +112, 
+113= ATG) 



3-0 -1274 bp (1386 bp 5' region of Pptxxb 3) 

3-2 - 765 bp (879 bp 5' region of Pptub 3) 

3-3 - 272 1:^ (384 bp 5' region of Pptub 3) 

3-4 + 52 bp (60 bp 5' UTR of Pptiab 3) 



Promoter fragment 3-2 can be defined as the shortest promoter 
fragment giving high expression rates. The rates are app* 300% 
compared to values generated with the 35S promoter, which was 



trans forma- 
4. (5' UTR 



set to 100%. 
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Pptub 4 

Two different promoter lengths were cloned into the transforma- 
tion vector pRT101p21 in front of the reporter gene. The data 
are given in Figure 6 . 

(5' UTR = TSSs (+1 imtil +103) until +205, +206= ATG) 

4-0 -419 typ (624 -bp 5' region of Pptxib 4) 

4-1 - 1 bp (206 }y£> 5^ region of Pptiib 4) 

Promoter fragment 4-1 gives escpression rates that are are app. 
250% compared to values generated with the 35S promoter, which 
was set to 100%. Note that upstream of this minimal promoter 
fragment (4-0) a long, very A/T rich region starts (75% A/T for 
1750 bp) . 

In sioinmary transient promoter activity of Pptub 1 to 4 genomic 
upstream regions were characterised. Minimal promoter fragments 
showing a maxiimam of promoter activity were defined and gave 
yields of up to 3 times the 35S promoter activity. 

Pptub-constructs summary (see also: sequence listing) 

Pptvibl upstream 

-1533 until -1 (+1 = start codon) 
-1533 until --644 = 81 % AT 
-1533 VEGF 1-0 (primer 364) 
-1211 VEGF 1-1 (primer 219) 
-642 VEGF 1-2 (EcoRI/XhoI) 
-474 VEGF 1-3 (primer 549) 
-309 VEGF 1-4 (primer 226) 

-297 VEGF 1-5 (primer 550; without putative TATA box: -304 -un- 
til -295) 

-226 TSS (start of 5'DTR) 
Pptubl downstream 

1 until 1539 (1 = directly behind stop codon) 
332 end of longest EST (3 'UTR) 
1539 start of primer 90 
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Fpttib2 upstream 

-1197 until -1 (+1 = start codon) 
-1197 VEGP 2-0 (primer 291) 
-798 VEGF 2-1 (Bglll) 
-547 VEGP 2-2 (Sail) 
-450 until -489 = poly A stretch 
-367 VEGP 2-3 (EcoRI/Sall) 
-189 VEGP 2-4 (Xhol/Sall) 
-122 TSS (start of 5'UTR) 

Fptub2 dovmstream 

1 until 1012 (1 =» directly behind stop codon) 
297 end of longest EST (3'UTR) 
1012 start of primer 206 

Pptub3 upstream 

-1386 until -1 (+1 = start codon) 

-1386 VEGF 3-0 (primer 292) 

-879 VEGF 3-2 (Sail) 

-384 VEGF 3-3 (Ecol47l/HincII) 

-112 TSS (start of 5'UTR) 

-60 VEGP 2-4 (Xhol/Sall) 

Pptub3 downstream 

1 until 997 (1 = directly behind stop codon) 
203 end of longest EST (3'UTR) 
1012 start of primer 204 

Ppttib4 upstream 

-624 lontil -1 (+1 = start codon) 
-624 VEGF 4-1 (primer 373) 
-206 VEGP 4-2 (primer 548) 

-205 until -103 area of TSS (start of 5'UTR) 
-55 until -93 CT stretch 

Pptub4 downstream 

1 xaitil 1146 (1 = directly behind stop codon) 
466 end of longest EST (3'UTR) 
1141 xrntil 1164 Ncol 
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EX2UyiPliK 2z Cloning and analysis of actin genes from different 
moss species and their expression promoting regions. 

2.1.6enomic structure of Physcomitrella patens actin genes. 

Pour actin genes and promoter regions of the moss Physcomitrella 
patens and three from Fimaria hygrometrica and the liverwort 
Marchantia polymorpha have been isolated in order to construct 
expression vectors for their use in moss. 

Using specific oligos designed from Physcomitrella EST sequences 
that are present in the public databases, four actin genes 
(Ppactl, Ppact3, PpactS and Ppact?) were isolated in several 
rounds of iPCR from genomic DNA and sequenced. 

In Physcomitrella the structure of the isolated genes resembles 
in one case (Ppactl) the conserved structural organisation of 
actin genes of higher plants. The un-translated leader is dis- 
rupted by a relatively long (955bp) intron located 14 nt up- 
stream the initiator ATG. The coding region presents three 
smaller introns which are situated at the same positions as the 
introns of actin genes of other plant species. The first one is 
located between codons 20 (lys) and 21 (ala) , the second is 
splitting codon 152 (gly) and the third is between codon 356 
(gin) and 357 (met) . This general structure appear to be differ- 
ent for the three other Physcomitrella actin genes isolated 
(Ppact3, PpactS, and Ppact?) • In those cases the 5'UTR intron 
(434bp, 1006bp and 1055bp respectively) is also located 14nt be- 
fore the ATG but the coding region is disrupted only by one in- 
tron positioned between codons 21 (lys) and 22 (ala) (Fig* 7) . 

2.2 .Activity studies of the expression promoting regions of 
actin genes. 

To study the activity of the different Physcomitrella actin ex- 
pression promoting regions (Seq^lD Nos. 5 to 8) as well as the 
effect of the 5'UTR of the different genes, different vectors 
were designed for expression of the hVEGF protein under the con- 
trol of the 5' regions under study. 
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Aroiind 2Kb genomic regions upstream the transcription initiation 
site were isolated by iPCR from genomic DNA and sequenced, and 
vectors containing the cDNA of the hioman VBGF driven by the pro- 
moters and containing the exact leader sequences including the 
5'intron were constructed for transient transfection of moss 
protoplasts. The complete 5 'promoting expression regions were 
amplified by proof reading PCR using primer 395 and 332 for 
Ppactl, 408 and 333 for PpactS, 511 and 334 for PpactS, and 413 
and 335 for Ppact7. 

Transformation of protoplasts was performed using the same niom- 
ber of molecules for each construct to be tested and in parallel 
to a construct carrying the hVEGP cDNA under the control of the 
CaMV 35S promoter • The hVEGF protein contains at the N- terminal 
part a 26 aa signal peptide that permits secretion of the recom- 
binant protein to the medium. Analysis of the transformations 
was carried out by ELISA, taking different dilutions of the me- 
dium where the protoplasts were incubated 48 hours after trans- 
formation* 

The capacity to drive expression of the different Physcomitrella 
5' actin regions was compared to the activity of the con- 
stitutive 35S promoter. 

In all cases analysed, the 5 'regions of the actin genes were 
reaching higher activity than the 35S promoter. However the 
level of expression varied for the different actin regulatory 
sequences. Thus, the 5 'sequence of Ppact3 was only promoting 
around a 2 fold higher expression of VEGF than the 35S promoter. 
Higher levels of VEGF were measured when vectors containing the 
5 'regions of Ppactl and Ppact7 were used for transformation. In 
those cases values between 4 and 8 folds the 35S values were ob- 
tained. Nevertheless the most dramatic differences were observed 
in the case of the S'PpactS gene, where up to 11 fold higher ex- 
pression values coinpared to the 35S were in some cases obtained 
(Fig. 8) . 

To further investigate on the role of the 5' UTR region of the 
high activity Physcomitrella actin genes, vectors containdng de- 
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letionSr combinations and substitutions of the 5'UTR intron were 
made and used for transient assays in moss protoplasts. 

Deletion of the Ppactl 5 'intron dramatically decreased the 
levels of transient expression in comparison to those obtained 
when the intact 5 ''region of Ppactl was used. In this case the 
amotint of secreted VBGF protein that could be detected in the 
protoplasts medium was very similar to the obtained by the CaMV 
35S promoter. This would indicate that the 5 'intron of the 
Ppactl is essential for efficient gene expression from the 
Ppactl promoter. Same results were obtained when the 5'UTR in- 
cluding the leader intron was fxised downstream the 35S promoter 
This construct yielded the same amount of secreted protein as 
the intact 35S promoter indicating that the 5 'UTR region is not 
having any dramatic influence on the activity of promoters other 
than the Ppactl promoter. It is important to indicate that a 
construct carrying just the 5 'UTR Ppactl region was able to pro- 
mote protein production only in a 30% lower amoxjoit than the 35S 
promoter alone. This could suggest a small promoter activity in 
this region of the gene, or a rest of promoter activity present 
in the backbone sequence of the vector (Fig. 9) . 

The same approach was used to investigate the influence on the 
promoter activities of the 5'UTR introns contained in the PpactS 
and Ppact? genes. Constructs in which the 5' intron was deleted 
were analysed and similar results as in the case of Ppactl were 
obtained, ie. the amotant of protein reached was approximately 
the same as with the 35S promoter in the case of PpactS aind 
slightly lower in the case of Ppact? , indicating that the pres- 
ence of the intron in the 5'UTR is essential for the efficient 
activity of the promoters. Again some residual promoting activ- 
ity was observed when the transformation was performed with con- 
structs containing only the 5 'transcribed region up to the ATG. 
Furthermore, in the case of these two genes, the fusion of the 
5'UTR downstream the 35S promoter yielded higher rates (2 to 7 
folds) of expression of the VEGF protein when compared to the 
35S promoter alone (Fig. 10, 11) . Similar results were observed 
in the case of Ppact3, where the 5'UTR alone or fused downstream 
the CaMV 35S, yielded around 2 eind 3 folds respectively in com- 
parison to the 35S (Fig. 12) . These indications would suggest 
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the presence of enhancer activity in the 5 'transcribed regions 
for these three genes even when they are positioned under a dif- 
ferent promoter. 

To further investigate the role of the 5'intron present in the 
Ppactl, Ppact5 and Ppact? genes, substitutions of the leader in- 
tron of the Ppact 1 gene with the 5'intron of Ppact 5 and Ppact? 
were engineered in vectors for transient transformation. In par- 
allel substitutions of the Ppactl 5'intron with the ppactl in-- 
trons present in the coding region of the gene, were performed. 

Substitutions of the Ppactl 5'intron, by the Ppact 1 coding re- 
gion introns 1 and 3 resulted in a decrease of the expression 
levels of around 25%. Still the amoxant of protein detected was 
around 2-3 fold higher than the obtained with the CaW 35S pro- 
moter. The substitution of the 5'intron by the intron 2 of the 
coding region surprisingly resulted in no activity of the pro- 
moter {Fig. 13). The construct was however checked, and the se- 
quence showed that the splicing site for the intron was not 
correct. A new construct carrying the correct splicing sequence 
was made and the results after moss transformation indicated 
that the effect of the intron 2 is the same as for the other 
siibsti tut ions . 

A reduction of protein esqpression was also observed when the 
substitution was done with the 5 'introns corresponding to the 
PpactS and Ppact? genes, but in this case the reduction was 
slightly smaller. 

2*3 . Deletion constructs of the expression promoting regions of 
actin genes* 

A further characterisation of the different actin genes pro- 
moters was carried out by meiking deletion constructs of the 
5 'untranscribed regions and analysing them through transient 
transformation of moss protoplasts. 

Thus for the Ppactl constructs carrying different genomic region 
lengths (-1823bp, -992bp, -790bp, -.569bp, -383bp, -23?bp, and 
-82bp) upstream the initiation of transcription (+1) were made. 
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In principle all the constructs except the -82bp, could have 
full promoter activity. However the -383bp construct shows a 
reduction of activity and reaches similar levels as the "82bp 
construct (Fig .14) . 

Analysis of deletion constructs of the promoter region of Ppact3 
revealed some interesting features. As it was described, this 
promoter pres^ted a lower activity coirqpared to the other actin 
genes promoters, although in relation to the CaMV 35S/ it was 
slightly more active^ in this case the following 5 'un transcribed 
regions were tested: -2210bp, -995bp, -821bp, --523bp, -323bp, 
-182bp and -81bp. Siirprisingly the activity of the promoter was 
approximately the same as the CaMV 35S for the constructs con- 
taining up to -821bp of the promoter region. However the con- 
structs containing from bp -523 and shorter regions towards the 
transcription start, yielded two folds more amoxint of recombin- 
ant protein. This could indicate cis-acting regions located up- 
stream the -523bp region that down regulate the transcription of 
this gene during the transient transformation assay (Pig. 15) . 

In the case of PpactS, constructs containing the -1872bp, 
-758bp, -544bp, -355bp, and -121bp fragments upstream the tran- 
scription start of the gene were generated. The results obtained 
from the transient assays indicate that the full activity of the 
promoter resides in a region between -758 and -121 from the 
start of transcription (+1) (Fig. 16) . 

The following deletion constructs for the 5'untranscribed region 
of Ppact7 were analysed: -1790bp, -1070bp, -854bp, -659bp, 
-484bp, -299bp, and -66bp. The results obtained indicate that 
the region coii5)rised in between -484bp and -299bp is essential 
for the full activity of the promoter during the transient ex- 
periment assays. (Pig. 17) . 

In order to obtain a set of heterologous promoters of the Phy- 
scomitrella actin genes, other two species, the moss Funaria hy- 
grometrica and the liverwort Marchantia polymorpha, were used to 
isolate genomic DNA fragments containing actin genes. To this 
end, oligos with different degrees of degeneration were designed 
to perform PGR reactions using as teittplate genomic DNA isolated 
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from the two species. 

2*4.ConiEparison of different actin genes from the different moss 
species Physcomitrella patens, Funaria bargrometrica and 
stteLTcshantia polymorpha 

Physcomitrella patens 

The four different genomic actin sequences isolated from Phy- 
scomitrella patens are likely to represent the whole functional 
sequences of the genes including 5 'promoter sequence, 5 'UTR + 
5'intron, ORF + internal introns and the 3 'UTR and further 3'- 
downstream sequence. In total for Ppactl 5809 bp, for Ppact3 
5633 bp, for Ppact5 8653 bp and for PpactT 6351 bp of genomic 
secjuence was isolated (Fig. 18 A) • The coding regions of the 
isolated Physcomitrella actin cDNAs are almost all 1137 bp in 
length, except Ppactl which has an ORF of 1134 bp. The corres- 
ponding proteins are 378 amino acids in lengths except Ppactl 
which has 377 amino acids. On the nucleotide level the coding 
sequences share homologies between 86.6 and 98.9 %. The protein 
sequences have an identity between 97.1 and 99.7 % (DNA STAR, 
MegAlign Program, Clustal V (weighted) sequence alignment) . 

For all four Physcomitrella actin genes extended genomic DNA se- 
quences 5 'of the ATG Start codon could be isolated by iPCR and 
sequenced: 2973 nt for Ppactl, 3091 nt for Ppact3, 3095 nt for 
Ppact5 and 3069 nt for Ppact7. For Ppactl/ Ppact5 and.Ppact7 
5 'race by using the Gene Racer Kit (Invitrogen) , which allows 
the amplification of only full length cDNAs, was performed to 
determine the 5 'UTRs of the genes . For Ppact3 the 5 'UTR was de- 
termined by the length of different ESTs from database. By com- 
paring the CDNAs with the genomic iPCR fragments the presence of 
large 5 'introns could be shown. The lengths of the 5 'introns 
which are all located at position -14 to the ATG Start codon are 
955 bp, 434 bp, 1006 tqp and 1055 bp for Ppactl, Ppact3, Ppact5 
and Ppact7 respectively (Fig. 18 A) . The positions of the ORF 
internal introns was determined by comparing the genomic se- 
quences and the derived protein sequences to the cDNA sequences 
and protein sequences of the actin genes from Arabidopsis thali- 
ana* The 5 'promoter sequences for the Physcomitrella actin genes 
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available are 1824 nt for Ppactl, 2270 nt for Ppact3/ 1909 bp 
for PpactS and 1805 bp for PpactV (Fig. 18 A) • 

In total 4 different actin genes from Fxxnaria hygrometrica 
(expression promoting regions: Seq,ID Nos. 9 to 12) and 3 dif- 
ferent genes from Marchantia polymorpha (expression promoting 
regions: Seq.ID Nos. 13 to 15) could be identified by degener- 
ated PGR on genomic DNA. As the aim was predominantly to isolate 
5 'promoter regions of the putative different actin gene homologs 
from the different moss species, most of the sequences are in- 
complete at the 3 'end to date (Pig- .18 B/C) . 

Funaria hygrometrica 

For Funaria the identified actin genes were named Fhactl, 
Fhact4.4, Fhact5 and FhactSb. 3951 bp of Fhactl, 2417 bp of 
Fhact4.4, 4432 bp of Fhact5 and 722 bp of FhactSb of genomic se- 
quence could be isolated by iPCR for the different actin genes. 
The complete coding cDNA sequence could be isolated for the 
Fhactl gene which has a coding sequence of 1134 nucleotides • For 
the other Funaria actin genes partial sequences are available at 
the moment, laclcing the 3 'ends: 906 bp for Fhact4.4, 965 bp for 
FhactS and 722 bp for Fhact5b (Fig. 18 B) The isolated coding 
sequences share homologies in a range of 87.4 and 99.2% on the 
nucleotide level. The derived protein secpaences are 90.8 to 99.2 
% identical (DNA STAR, MegAlign Program, Clustal V (weighted) 
sequence alignment) . 

Except for Fhact5b, 5 'sequences upstream of the ATG Start codon 
could be isolated by iPCR and sequenced. Xn the case of Fhactl 
1824 bp, for Fhact4.4 1333 bp and for Fhact5 3289 bp are avail- 
able. The length of the different 5'UTRs were determined by 
5 'race using the Gene Racer Kit (Invitrogen) . The intron-exon 
structure was determined by comparison of the cDNA sequence 
with the genomic sequences obtained by iPCR and by comparison to 
the Physcomitrella genes. As in the case of the Physcomitrella 
actin genes the identified Ftinaria actin genes contain large 
5'introns located at position -14 of the cDNAs, 928 bp, 1015 bp 
and 656 bp in length for Fhactl, Fhact4.4 and FhactS respect- 
ively. By now for Fhactl 700 bp, 145 bp for Fhact4.4 and for 
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PhactS 2515 bp . of 5 'promoter sequence was isolated and se- 
quenced. For Fhactl 419 bp of the 3 'region was isolated. The 
5 'regions or 3 'regions of the Funaria act in genes are amplified 
by PGR on genomic DNA from Fxmaria hygrometrica by using the 
primers 908 and 909 for the 5' region of Fhactl, 983 and 984 for 
the 3' region of Fhactl, 1000 and 1001 for the 5 'region of 
Fhact4.4 and 611 and 612 for the 5 'region of FhactS. 

Marchantia polymorpha 

For Marchantia the identified actin genes were named Mpactl, 
Mpact4 and MpactlS* For all three seq[uences the 3 'ends are lack- 
ing. So far for Mpactl 2229 bp, for Mpact4 3987 bp and for 
MpactlS 2174 bp of genomic sequences were isolated and se- 
quenced. The lengths of the coding cDNA sequences isolated are 
997 nt, 962 nt and 995 nt for lft>actl, Mpact4 and MpactlS re- 
spectively. (Fig. 18 C) . The sequence homologies within the 
Marchantia actin genes are a little bit lower than compared to 
the other two moss species, in a range between 78.3 and 85.5 % 
on the nucleotide level and between 94.7 and 96.1 % on the amino 
acid level (DNA STAR, MegAlign Program, Clustal V (weighted) se- 
quence alignment) . 5'upsteam secjuence of the ATG for all the 
three identified different Marchantia actin genes were isolated 
by iPCR and sequenced; 937 bp for Mpactl, 3025 bp for Mpact4 and 
910 bp for MpactlS. The 5 'regions of the the Marchantia actin 
gene homologous are amplified by PCR on genoiaic DNA from 
Marchantia polymorpha using the primer 950 and 951 for 5 "Mpactl, 
960 and 961 for Mpact4 and 970 and 971 for MpactlS. The intron- 
exon staructure of the ORF was obtained by comparing the differ- 
ent actin gene sequences from the different moss species . The 
isolated 5 ' sequence of Mpactl shows the consensus secgpience for 
intron splice sites (aggt) at position -14 indicating the pres- 
ence of a 5 'intron as in the case of the other Physcomitrella 
and Funaria genes. Within the 5 'upstream sequences of Mpact4 and 
MpactlS no intron splice site consensus sequence is present, 
proposing the lack of 5'introns (fig. 18 C). 

Comparison of of P, patens, F. hygrometrica and M. polymorpha 
actin genes 
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As mentioned above in general the homologies of nucleotide and 
protein sequences for the different isolated actin genes within 
one species is very high especially at the protein level . The 
homologies between the closely related moss species Phy~ 
scomitrella patens and Punaria hygrometrica also appear to be 
very high. On the nucleotide level the actin genes show homolo- 
gies between 86.9 and 96.3 % identity and on the amino acid 
level the range of homology is 95.5 to 99.7 %. 

In contrast to that the more distant relation of the liverwort 
Marchantia polymorpha to the other both species is reflected in 
the lower homologies of the genes on the nucleotide level. The 
homologies between Physcomitrella and Marchantia actin genes is 
in the range of only 75.2 % and 78.8 % and between Fionaria and 
Marchantia the homologies are in the range of 75.5 % to 80.4 %. 
On the amino acid level the homologies of the Marchantia actin 
genes vary between 93.0 % and 96.1 % compared to Physcomitrella 
and between 93.4 % and 96.7 % conipared to Funaria. 

Intron-exon structxire (Fig. 18 A/B/C) 

As indicated before the intron-exon structure of the Phy- 
scomitrella actin genes to a certain extent are similar to that 
of higher plants but also with clear differences. All isolated 
Physcomitrella actin genes contain a large 5'intron in the 5 'un- 
translated region, which almost all of the investigated higher 
plcints actins do. Only Ppactl contains 3 internal introns within 
the ORF reflecting the situation for example for all isolated 
actin genes from Arabidopsis thaliana. The ORF internal intron 
positions of Ppactl are also consearved compared to higher plant 
actin genes. On the contrary Ppact3, Ppact5 and Ppact7 contain 
only one internal intron within the ORF* 

The same genomic structtire can be found in the isolated Funaria 
actin genes with one extended 5 'intron within the 5'UTR. Fhactl 
has the same conserved intron-exon structure as Ppactl whereas 
Fhact4.4 and FhactS contain only one intennaal intron within the 
ORF sequence. The isolated secjuence of FhactSb is to short to 
say something clear about the intron-exon structure but at least 
it does not contain the internal intron2 compared to Fhactl or 
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ppactl . 

In Marchantia the genomic structiires of the isolated actin genes 
SB&a to be more different. It is important though, to indicate 
that the number of different actin genes in the three different 
moss species is not knowi and it could be that the three isol- 
ated actin genes from Marchantia do not represent the individual 
fuQCtionally homologous genes* It is likely that there are more 
than three actin genes present in Marchantia and more than four 
actin genes in Physcomitrella and Funaria. 

However, the intron-exon structure of Mpactl seems to be the 
same as in the case of Ppactl and Fhactl with a 5'intron within 
the 5'UTR and the coiaserved positions of the ORE internal in- 
trons 1 and 2. B^actlS also contains the conserved ORF internal 
intronl and intron2 but it does not have a conserved intron 
splice site at position -14 within the 5'UTR or at position -10 
as found for the Physcomitrella or for some Arabidopsis actin 
5'introns respectively, arguing for a lack of a 5 ^intron. The 
same situation is found for 3y[pact4, probably lacking a 5 'intron. 
In addition Mpact4 also does not have the intronl or the intron2 
within the ORF, which is different from all isolated moss actin 
genes so far. 

Putative homologous moss actin genes 

Although the intron-exon structixre of the different isolated 
actin genes from Physcomitrella and Fvmaria might propose con- 
clusions about homologous genes between the two species one can 
not conclude this from the genomic structure. For example Ppactl 
and Fhactl share the same conserved intron-exon structure but it 
is not clear, as indicated before, whether there are more genes 
present in the genome of both plants which might have the same 
genomic structures. To give a statement on homologous genes also 
expression data would be required to propose fianctional homolo- 
gies. Also from the sequence homologies of the proteins or the 
coding cDNA sequences it is not possible to make any assumptions 
about corresponding homologous genes between the species as they 
are too similar in general. 
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But in the case of Physcomitrella and Fixnaria it was interesting 
to find also very high sequence homologies within the non coding 
sequences regarding to the UTR sequences, intron sequences and 
promoter sequences. Therefore high homologies were found between 
Ppactl and Fhactl and between Ppact3 and FhactS. In both cases 
the intron sequences showed iinusual high conservation. In the 
case of Ppactl and Fhactl the hoitiologies were as follows: 5 'in- 
tron: 58 %; intronl: 64 %, intron2 : 52 % and intronS ; 55 %. In 
the case of Ppact3 and Fhact5 the homologies are for the 5 'in- 
tron 51 % cuid intronl shows 48 % identity. 

For both cases also the isolated 5 ' promoter sequences show high 
homologies • Fig, 19 A shows a schematic conqparison of the isol- 
ated promoter regions of Ppactl and Fhactl. The transcription 
start is said to be at position 1, the first nt of the 5 'pro- 
moter region is said to be -1. The isolated 267 bp of 5 'promoter 
region of Fhactl show an over all homology to the first 267 bp 
of the Ppactl 5 'promoter region of 58 %. Within this sequence 
there are blocks of different homologies observable. The se- 
quence between -267 and -129 shows a homology of 51 %. The fol- 
lowing 29 bp show 62 % identity and within position -100 and -1 
the homology is almost 70 %. Concerning these high sequence 
identities between the Ppactl and Fhactl intron and promoter se- 
quences it is reasonable to put these two genes as the homolog- 
ous genes in these two mosses. Another interesting aspect is the 
observation of the drop of expression obseinred between the dif- 
ferent Ppactl :vegf deletion constructs (Fig. 15) . The dramatic 
drop of expression appears to be between the -237 and the -82 
deletion construct. This ctrgues for an important function of the 
5 'promoter region between -129 and -1 as here the sequence of 
the promoter regions of Ppactl and Fhactl is highly conserved as 
just mentioned and the -82 deletion constiruct does not contain 
all of the highly consearved sequence but the -237 deletion con- 
struct does. 

Highly conserved regions within the promoters of Ppact3 and 
FhactS can also be observed. In this case the promoter regions 
for both genes isolated are much longer. Therefore even more re- 
gions of homologies are found between the two 5 'promoter regions 
(Fig. 19 B) . In this case the promoter regions of Ppact3 from -1 
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to -2270 and of FhactS from -64 to -2325 show some interesting 
homology featxzres. The difference in the TS position might be 
due to the fact that the 5'UTR of FhactB was determined experi- 
mentally and the one of Ppact3 was determined by analysing ESTs 
from database. 

The sequence of PpactS between -2270 and -1876 shows only a 29 % 
low homology to the same sequence area of FhactS located between 
-2325 and -1948. Then an expanded region of about 1100 nt is 
following showing a very high homology of 82 %. The next 140 nt 
of Ppact3 and 152 nt of FhactB promoter show ''only^ 53 % homo- 
logy. The sequence of Ppact3 located between -641 and -463 shows 
again high conservation of 76 % to the region between -705 and 
-528 of Fhact5. The following about 180 nt show again lower ho- 
mology of 53 %• The last 288 bp of Ppact3 promoter sequence then 
are again more homolog with 73 % to the next 280 bp of Fhact5* 
These regions of different degrees of homologies between the two 
homologous genes might indicate the presence of regulative act- 
ive elements within the 5 'promoter region. 

As for the case of Ppactl and Fhactl also here the expression 
analysis of the different Ppact3:vegf deletion constructs are 
interesting in this context (Fig. 17) . Here a significant in- 
crease of the vegf expression level of the -2210, -995, -821 de- 
letion constructs compared to the -523 deletion construct was 
observed. The three deletion construct which contain at least 
parts of the eaqpanded homolog region between -1876 and -779 
found in Ppact3 and Fhact5 reached levels about that of the 35 S 
promoter whereas the -523 deletion construct showed a 2 % fold 
increase of expression compared to the 35S promoter or the 
longer deletion constructs. This might argue for the presence of 
a negative regulator within this region of 82 % homology between 
Ppact3 cind Fhact5. 

In the case of Mfeirchantia, no comparable sequence homologies 
could be found between the different actin genes from Phy- 
scomitrella and Funaria. 



For the Fhact5 gene a construct containing 1157bp of the 5'un- 
transcribed region fused to the hVEGF cDNA was made and used for 
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transient transformation experiments on Physcomitrella proto- 
plasts. The amovmt of protein detected in this case was in the 
same range but slightly higher (up to 2 folds) as with the CaW 
35S promoter. The FhactS gene presents the highest homology to 
the PpActS gene, and interestingly both of the promoters showed 
a similar activity in Physcomitrella protoplasts during the 
transient assays. 

2.5 .Stable transgenio lines* 

The cassettes containing Ppactl, PpactS and Ppact? 5' MEPRs 
driving the expression of the VEGP cDNA were introduced in the 
genome of Physcomitrella plants. For each of the MEPRs five to 
ten stably transformed plants were recovered and tested for the 
expression of rhVEGF, For these three MEPRs tested, expressed 
and secreted moss derived rhVEGF was detected in the super- 
natants of the cultures where the plants were growing (standard 
Kiiop mediiHn) , indicating that the MEPRs promote protein expres- 
sion imder non- inducing conditions (standard conditions) when 
they are integrated in other parts of the genome • The amount of 
protein that could be measured in those lines ranged from 7ngVE- 
GF/mg moss dry weight until 53ngVEGF/mg moss dry weight, depend- 
ing on the construct and the stable line. 

One transgenic moss strain containing VEGF cDNA under control of 
PpactS was used to perform bioreactor cultures. The amount of 
moss derived recombinant VEGF in the supernatant of bioreactor 
cultxires measured by ELISA was 40-50ngVEGF/mg moss dry weight. 

EX2UIIPI1E 3 s Cloning and analysis o£ Physcomitrella parens and 
Funarla lqr9roiiie1:rlca ublqultln genes and tbelr expression pro- 
noting regions. 

Taking advantage of the presence of several EST sequences cor- 
responding to polyubiquitin genes of Physcomitrella, specific 
oligos were designed to isolate the corresponding genomic se- 
quences of the most abundantly present EST of the xxbiquitin gene 
homologous sequence in the databases, named Ppubql, 214 6bp of 
5 'region of Ppubql could be identified by iPCR • A 129bp tran- 
scribed 5 'leader is present before the ORF starts, determined by 
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5 'race. lOxe 5 "region of Ppiibql is amplified by PGR on genomic 
DNA from Physcomitrella patens using the primers 777 and 602. 

Vectors carrying different parts of promoter and 5'UTR region 
driving expression of the hVEGP cDNA, were consttncted to ana- 
lyse the activity of the the promoter dTJoring transient trans- 
formation of Physcomitrella protoplasts. 

The results indicated a similar activity for this promoter to 
the Ppact5 promoter (or even higher) . The constructs tested, 
1.6Kb and 1.3Kb promoter fragments, reached expression levels 
around 4 times and almost 7 times higher than the CaMV 35S. 

The libiquitin gene from Funaria, Fhubql, was identified by per- 
forming a 5' race PGR on Punaria total RNA with a primer derived 
from the Ppubql coding sequence. The isolated 5'UTR sequence and 
partial coding sequence was used to design primers for iPGR on 
genomic ligations of Funaria hygrometrica. This way 5' upstream 
sequence of the 5'UTR was identified. The 5 'region is amplified 
by PGR on genomic DNA from Funaria hygrometrica using the 
primers 943 and 944. 

EXZ^MPZiE 4 s Cloning and analysis of Physcomitrella patens RBCS 
e3spresslon prcgncioting regions. 

As putative candidates next to the actin, tubulin and vibiqaitin 
genes the ribulose-1, 5- bisphospate carboxylase/ oxygenase small 
STxbunit (rbcS) genes were taken into consideration. The differ- 
ent rbcS genes are encoded on the nuclear genome. The rbcS genes 
are members of a gene family. The rbcS genes are expressed ba- 
sically in all green parts of plants able to fixate GO2. There- 
fore this gene family is of interest to get 5 'and .3' flanking 
expression promoting regions of different rbcS genes from dif- 
ferent mosses. As a first step Physcomitrella EST databases were 
analysed. It was found that the rbcS genes from Physcomitrella 
patens are organised in a gene family^ consisting of 12 genes. 
The most abundantly present ESTs of the rbcS genes, named Pprbc- 
S12, was taken as a candidate to find it's 5' and 3' expression 
promoting sequences. Starting with the EST sequence data, 5' and 
3' flanking regions of this gene was identified by iPGR and the 
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cloned 5 •and 3' regions were sequenced. The 5 'region is ampli- 
fied by PGR on genomic DNA from Physcomitrella patens using the 
primers 839 and 858* The 3 'region is aitqplified by PGR using the 
primers 904 and 901. 

Xn the enclosed Se^ence lii sting, the following sequences az^e 
given ( Seg .TP. No/name of sequence/ 5* or 3' region relative to 
the protein encoding region) s 
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Table 1: List of primers 

Ho. sec[aence|5'-3') 

35 AmGGAGATGmGGCG 

36 OmOGCTGmTRGTYCC 

38 mmmmcmm 

40 G6B&T66imG&TG6AGmC 

67 AQtmmQOmTRCGCTmmftTTC 

69 GTCICTAGACGACAAGACCGGGQAmCAGC 

70 T(aGTGCTCm!imTCTC!KTO 

71 (mmmmQkismmkQmm!S 

89 mimmmmkommmi 

90 !I:G(X:IGCTCM(^6!MG8(^^ 

91 cmimkwawmmmis 

95 GA(XmAAACCII!GQmiCACGCAm 

113 mS^mAIISUUiGATGIK^ 

149 CTCACCAGCCAATGGCTATGC 

203 aMmcmGTTGTCmCIPPC 

204 GATCGAAATT(mfim:CTCCAC 

205 mmkmmmQimsM. 

206 Mmmmxmmim 

219 mm!smmosmimmiA'mi£ 

223 QTCT!im?rAAG6ATGA!01Ca&(m 

225 m:!mmmmmMS 

226 CEACmtXaaTAIMTACTGCCCTAGTGCC 

291 GA(aGATCmmGTCGA(MGG(ra3GAO(3!l!& 

292 G&C(X!GTGGGACTiaCITT6Tm^ 

296 QWlXmQSISk'Pm^ 

297 (aiTCCCMX3TT(XTKa^ 

298 GITTTGTGGCTCn^TTGG 

299 ATOBCTTCmOICTTCTTO 

300 GmCGCTCGCAATGCGiaCT 

320 AACT!I°ICTOTGT(^^ 

321 6ACCm(mcm6CTOAATCATGGTCATAG 

332 QTTTOTAATACCGArammCCA 

333 CA!mGAA6AAAMmCATCAAAA6 

334 (mkfmwmmmMm 

335 (WTTmGAATGATCCIIAC&GQlGTlX: 

336 ASATCTQGCAAeiTCCCinCG 

337 6BAGA6y&6AAGGe»m&T6 

338 G6AAGAA6A6T0GAG&ASCGAT 



Ho. sequence(5'-3') 

363 aTCTOTCCAACTACCGCGACCCGAACCC 

364 mmmsiimiammspmmc 

373 (mmmmmmmmmae 

374 (mmmtsfxmxmsc 

395 ATCmG6&T(XmAACGGAGGATAA6T 

408 CAACmGA!imOTTAAGaX:mTTO 

413 ATTTCm(MG!IMATCATGTTAATT(mATGGT 

511 Tm03A6ACimCTAATTGACAAei!AT6 

547 (mmfmwpixmm 

548 m!m%GTmX!GC!^^ 

549 G!l!G(mai^^ 

550 MM^mmxssmmixmk'K 

602 CA1CCTTACAG6IUmCK6 

611 AmTGGOAAACATOCCCTG 

612 CATCGAGA-RJAAATGrrCTG 

777 mCTOGAGAIEACAAGaGmTAAATCATATAC 

839 ATATCmGAmTGrnGATAAmmmGA 

858 (mmim:!mx!iiaa(m!c 

901 iwmm(mMmmim'!msc 

904 KmmocGOimaMmimMx^^ 

908 (nTPCGTGTOOm 

909 (aTTTCTTAATACGGACCTGCC 

943 AmTCmGGAATT(MTOaTTAA(3GAGAA'EATGAC 

944 CATCTOCAAOSCTmTCAOTC 

950 CATATGCGTAGGGAGfimG 

951 mmmmxms: 

960 TCm'ieiTAAGCGmiCA 

961 GTISUmAGQ&GSrGTCX^ 

970 AAGCmGCAAGCAGCTCKmC 

971 ATCGACGATAGACTGCAAGCC 

983 AGGaGTGWACACATCTTTTAC 

984 (36CTAAGAC6ACGCATTCTGT6 

1000 G6ATCCGAGAG6AAAG&GA6A6 

1001 (XXITACAATGATCCTGCATAG 

lOR vmm:!mamsfm 

8P CGGJAOmAGGGCCTC^ 

9F m2mimGG6(EA06TCT 

F7 mmitEmmcmiMx 
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Claims : 

1. Isolated nucleic acid molecule encoding a moss expression 
promoting region (MEPR) • 

2. Isolated nucleic acid molecule according to claim 1, charac- 
terised in that the MEPR is selected from MEPRs of Phy- 
scomitrella, Pimaria, Sphagnum, Ceratodon, Marchantia and 
Sphaerocarpos , preferably of Physcomitrella patens, Funaria hy- 
grometrica and Marchantia polymorpha. 

3. Isolated nucleic acid molecule according to claim 1 or 2, 
characterised in that the MEPR is selected from the Seg;. ID Nos- 
1 to 27 or expression promoting fragments thereof. 

4. Isolated nucleic acid molecules according to any one of 
claims 1 to 3, characterised in that it conqprises a moss pro- 
moter and preferably a 5'-UTR region and/ or a 5'-intron and/ or a 
3'-UTR 

5. Isolated nucleic acid molecules according to any one of 
claims 1 to 4, characterised in that the MEPR has an expression 
promoting activity being at least equal to the expression pro- 
moting activity of cauliflower mosaic virus (CaMV) 35S promoter. 

6. Isolated nucleic acid molecules according to any one of 
claims 1 to 5, characterised in that the MEPR has an expression 
promoting activity being at least 200 %, preferably being at 
least 500 %, especially being at least 1000 %, of the expression 
promoting activity of cauliflower mosaic virus (CaMV) 35S pro- 
moter . 

7. Isolated nucleic acid molecules according to any one of 
claims 1 to 6, characterised in that it further comprises a cod- 
ing region for a recombinant polypeptide product, said coding 
region being under the control of the MEPR. 

8. Isolated nucleic acid molecules according to any one of 
claims 1 to 7, characterised in that it further comprises a se- 
lection marker. 
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9. Isolated nucleic acid molecules according to any one of 
claims 1 to 7^ characterised in that it ftirther comprises se- 
quences which are homologous to genomic sequences of the species 
to be transformed thereby allowing targeted integration in this 
species • 

10* Isolated nucleic acid molecules according to any one of 
claims 1 to 5^ characterised in that is provided as an antisense 
or ribozyme molecule 

11. Process for the expression of a recombinant polypeptide 
product in an etikaxyotic host cell comprising the following 
stQps : 

- providing a recombinant DNA cloning vehicle comprising an isol- 
ated nucleic acid molecule encoding an MEPR according to any 
one of claims 1 to 10 and optionally a coding region for said 
recombinant polypeptide product, said coding sequence being un- 
der the control of the MEPR of said nucleic acid molecule in 
said host, 

- transforming said eukaryotic host cell which does not naturally 
. harbour said coding sequence in a way that it is under the con- 
trol of said MEPR, 

- culturing the transformed eukaryotic host cell in a suitable 
culture mediiua, 

- allowing expression of said recombinant polypeptide and 

- isolating the expressed recombinant polypeptide, 

12. Method according to claim 11, characterised in that said eu- 
karyotic host cell is selected from plant cells, preferably moss 
cells, especially Physcomitrella patens cells. 

13. Method according to claim 11 or 12, characterised in that 
said host cell is a protonema moss tissue cell. 

14. Method according to any one of claims 11 to 13, character- 
ised in that the culture medium is free from added phytohor- 
mones thereof . 
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15. Method according to any one of claims 11 to 14, ctiarac- 
terised in that the cell is selected from moss cells of the 
group Physcomitrella, Fiinaria, Sphagnijm, Ceratodon, Marchantia 
and Sphaerocaarpos • 

16. Method according to any one of claims 11 to 15, character- 
ised in that the host cell expresses said recombinant poly- 
peptide product transiently. 

17. Use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for industrially produ- 
cing a polypeptide, especially for providing recombinant cells 
producing said polypeptide. 

18. Use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for expression of a moss 
polypeptide, the expression of said moss polypeptide being not 
naturally controlled by said MEPR, especially for providing re- 
combinant moss cells expressing said polypeptide. 

19. use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for screening and defin- 
ing consensus segviences for expression promoting regions. 

20. Use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for recombinant expres- 
sion of postranslationally modifying proteins, especially for 
the production of posttranslationally modified proteins. 

21. Use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for in vitro expression 
of recombinant proteins. 

22. Use of an isolated nucleic acid molecule encoding an MEPR 
according to any one of claims 1 to 10 for recombinant expres- 
sion of metabolism modifying proteins. 
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Siuranary: 



Moss Eacpression Promoting Regions 



Disclosed are isolated nucleic acid molecules encoding moss ex- 
pression promoting regions (MEPRs) as well as a method for pro- 
ducing recombinant polypeptides using such MEPRs. 
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Fig. 14: Ppacti promotenvegf deletion constructs. 

1.5'Ppact1-1821:vegf 




2. 5Ppact1-992:vi^ 



polyA pof/A 



Are 




3. SPpact1-7^v^ 




4. 5Ppact1-Sra[v^ 



I |_ 



I 



5. 5Ppact1-383v€gf 




6. SPpact1-237:vegf 



7. 5T^dt1-82yegf 



8* SllTRivegf 



SIfttlllll 




/TO 
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Fig. 15: PpactS promoter: vegf deletion constructs. 

1.5'Ppact3-2208:vegf 



•^08 bp promoter 



2. 5*Ppact3-992:vegf 
3.6'Ppact3-821:vegf 

4. ffPpact3-523:vegf 

5. 5'Ppact3-323:v6gf 

6. 5'Ppact3-182:vegf 

7. 5'Ppact3-81:vegf 

8. ffUTR:vegf 



Slittron 



'S^fofiroh 



S*lnnnpn 





ATG 




ATS 




5*intr6nt 
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Fig. 16: PpactS promotenvegf deletion constructs. 



1.5"Ppacl5-1875:vegf 



2. 5' PpactS -758:vegf 



AT6 



3. 5' PpactS -544:vegf 



ATG 



4- S' PpactS -355:vegf 



ATG 



ATO 



5. S' PpactS -121 :vegf 

6. S' PpactS -121:vGgf 



l^inlrop 



ATG 



7. 5' PpactS promoter onlyrvegf 

200000 
180000 
160000 
^ 140000 
% 120000- 
U 100000- 
> 80000 
60000 
40000- 
20000- 
0 



758 bp'promotdr PpactS 




6 7 
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Fig. 17: Ppact7 promoter:vegf deletion constructs. 

1.5'Ppact7-1790:vGgf 



ITdOlbp promoter 



2. 5' Ppacf7 -1070:vefl fl ^ : ::;;j?>y#;Rg^*y 



3. 5' PpactZ -854:vegf 



4. 5* PpactZ -659:vegf 



5. 5' PpacfZ -484n^egf 



6.S9.|>p.|>rom6tjbF 



5'fntron 





ATG 








ATO 












ATG 


i *. . . . snntrpn 





6. 5' Ppactr -299:vegf 

7. 5' PpactZ -66:vegf 
8. 5' UTRactZ:vegf 



299 bp 



5%itron 





S^ntron. 








S'Intron 






■ Probe 1 

■ Probe 2 
□Probe 3 
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SEQUENCE LISTING 
<110> greenovation Biotech QmbH 
<120> Bryophyte expression promoting regions 
<130> R 42095 
<160> 27 

<170> Patentin version 3.1 

<210> 1 
<211> 1533 
<212> DNA 

<213> Physcomitrella patens 
<400> 1 

tagcataaga taaagatgtt ctctacctaa tttattttta tttatcacta ataactcata 60 

tcaatctaaa atatataaat gcctttaaca atagaagaat atgattcaac aaacccaatt 120 

ctatcattaa aaatatatct aagattagat atgataaaaa tagataataa tattaataaa 180 

tcattttaag gttgtaatgc aactataata atttttaata ttataacttt ttagtttttt 240 

aaaataaaaa taaaatgtta aaatattata aaataattat actttatata tttatgatca 300 

agttagtaca ttgatacatt taaagtccaa aataatttaa tgataccaac ttgcaaaaaa 360 

tttaatatta ttaaaatatt ttaaaaagtt aagagcaaga aaaattattc taaatagaat 420 

tcataccatg gtattataaa gatacaaaga atcaatgtgt atttatttat tttacataca 480 

ttacttgcaa tatatggttt atactacaaa tgactatata ttgaagatac taaccacaaa 540 

aataaaaatc cagcactaga taattctaaa aacatgaaat acaataaaac attacattac 600 

tagcttatat ggttactaaa tatttttaaa ttatacaaat aaaaaataaa aataaaacaa 660 

aaaaatocta tagtgacaag aaataaaata aaataaaaaa attataattg accaatccct 720 

aaaacattaa tatttaaggg atattcatat gacaataaag ataatttatt tcatggaacc 780 

ttgattattt tatcttttaa aggtggtatt tttaaaattg tttaatggta cttaaaatat 840 

tgtatttata tagagaaaat cctccaaaaa aattctctca caagggaata gaattcctca 900 

agtttttctc ttgactaaat tgaccaacca ccaaacaacc cacgtcatcc atccatccaa 960 

cccccacaca acccaattgt ttctccattg tagacatcga caaatgaaaa tcatbcgatg 1020 

acgtatacac ttcatcctct ggtccctcca gggtgccatg agccacatcc cgaccgccta 1080 

tttcagatcc gacggcacag ggtgacagag cagcggtctc agaccacgcc atttggaact 1140 

cgccagccct gccccagcta acagtttcaa agctgcccgc cataacccgg tcctcccagg 1200 

gccgttagat cgtccatcct acgggagcac atataatact gccctagtgc cctaatccga 1260 

tgggaacggg gagtccttta tctctctcgg aaagcgactc attcgccagt gtgcgcatcg 1320 

cccgtgtccc aaggcaccgg gccagactct cgcatcggct ctacccacac tcacccccac 1380 

1 



tcaccctgtg ttttctctgc ccccttcgcg ctcttcgtgt gtgtgtgttt tttcacggtc 1440 

gattggcgag ttgcgaagga gggcaagggt gctgtggtgc agcatcagct ggtagtaagt 1500 

cagtcagggt tcgggtcgcg gtagttggac aag 1533 

<210> 2 
<211> 1539 
<212> DNA 

<213> Physcomitrella patens 
<400> 2 

atgtatttcg gagcgatttc gtgtgctgtt ggtgtctttt ggttggaagc gatttaaaca 60 

ggagagtctg tttggtggct tagggtaatt cggtggagcc tgaaagatat tgctacgtct 120 

tgaaatacca tcttgtttca gtgcgcattg cttgcaaaag cattgatagt tgtagcggga 180 

tatggtgctg tttatggttg tatttgagca tatgtttcgt gacatotgtg ttgcttgttg 240 

ggcttgccat actggtagtg tcttgttgag tatcatattt actttccaat gtaatattca 300 

acattttctc ctagcattac tataccattt ccatctattc ccaatggcgc tatcgtctcc 360 

otgggataca tttaacccat atttgtagtc cagtgcatta aatgcatgtg aaatcgcatt 420 

tatagatgcg catatttaat gtcaaattag acatcttcac tcatataata cattttacca 480 

aaaaatgaaa tgtacsacaca gaatattttc aaactgccga ctatctcaaa aacctataca 540 

ttatcaatct cattgacata cctcattgaa atactcctea ttgaaatact acataatttt 600 

cattgtcaat attgccaaca ttcaaccatg agaagctgat tattatttct tttatactgc 660 

ttactctctt aatgcaaatt caccattcct catgagagca gctgtatcta ctcccctgat 720 

caatattact actaacttct caggaatagt actcgatatg ttgcgctggt tcagttacgc 780 

aattataaag tccatcgtgt aaaccataat cgtcacaact ggatatctga tgccagaatt 840 

tcagcaaatt ttagtgccga tccgaccagt tcaatgcaga agaggaatat aactatctag 900 

aggttgttca caatcttttt cattacagtg cagccaaagt tctgcaacga agatacattc 960 

gcaacttgca tgcaaggtga agacacatat cgcggctaga tcctcagtto gttgttaata 1020 

cctgggaaag aaaatcaaca aatcgaattt ttctgcatca aatagccatg acaaagatta 1080 

gtacttccag tgcaagtata gtctgcggaa atatatcgca gtcctcgtac tacagcttca 1140 

aaatttgggt acatgacgag gatttcgacg cacaagaaca gaattaaccc gatcgtatcg 1200 

agcgttacag taaacgagac gaagtgcctg tgtcttcaaa ccggcagatc tctacgaagt 1260 

aagaatctac tcagcagtga gagcgagagc atctggtgtg gcagaatcta ctatgattac 1320 

aagtgcccta tactgaatgt agaagcctgt atccacctca tataggaaac gaagtaatcc 1380 

ataccgacat gttacatatc tccactgaag cagttccgta tgggcataca ggaaatgatg 1440 

aagcacaacg cgtataccaa ttttttatca gatacacaat cacccaattc aaaacgcacg 1500 

2 



1:i:t:atat3aac caacacgcca tactcaagat gagcaggca 1539 

<210> 3 
<211> 1197 
<212> DNA 

<213> Physcomitrella patens 
<400> 3 

tccttagtcg agaaggcgcg ggacgtgagt gagctctgaa gataagcttc caatttgcca 60 

ctgcaagtgt aacctgctcc atcgggcgcg agtccgtagg gatcatgaac acctcatttc 120 

acttggcgtt agtgcactct agcggcattg aagcaatcca tgccctcaga atgagtcgcg 180 

gggggcagtg aacgaactag ttaagaaatc cagtaatgac ggcaccacat cggcagatcc 240 

agatccattg cagattatcc tcttcagccg gaccgaataa accatgccta aataaccacc 300 

ggaatgtgtc ctgtgcggga ctgattgttt tccaaagaaa cactaactaa ttatatccag 360 

acagtgggat gtatgcgggt atccgtgaag ccagatatga gatctctgat aaacctgagg 420 

aagatgtctt acatggcggc acgggaaaca cgaagaaaag ccgaggagaa ggtattgaaa 480 

gctgagcata gccattggct ggtgaggaaa gggcatgcaa caactcatcg aaagcggagt 540 

aaactttgaa atcccgtagg cttcatgcga tgttctaaat tcttagcctc gacgacgatt 600 

tcaaggtctg attcgaagct tccgagcggg gctccggaac tgtcacttca gtcgactttg 660 

aaatgtgaag cgactttgct cacttgtgac acagcaattc aactccacaa tataaaaaaa 720 

tcgcgaaaca aaaaaaaaaa aaaaaaaatc tactttactc gtcgatgttc cactcgaaga 780 

caaacagctt taaagcgttt acctgtggta gagatagatt tcggcgaagg aattcaaatc 840 

cagcaaccct cccactcgta ccgcagacct tgagtttgaa cggttctgtt gctgtttgcg 900 

gtgagttcaa aactcgactg acctctctga aaccaaaagt ttaccttgag ctgcccgaga 960 

atctccgaac gttcgatata agatccaacg gtctcaagaa attctccctc gaggaacacc 1020 

tatgcccagg ggcagggggt tcctttatct ttctccctct gccgcaatcc atttcattgt 1080 

gcttgcagga ctgtcatccc tccccttgtt gccagtggta tccggaggtt cccgcgacac 1140 

cttctggtgc cggaactaag gtctgttgtt cctttcgtga ggtagagcac actgaag 1197 

<210> 4 
<211> 1012 
<212> DNA 

<213> Physcoiaitrella patens 
<400> 4 • 

atgcgacccg aaggatgagt acacgcgttt tggttttacg ttactgactt ttagctcctc 60 

cattcacact gcaggccctg gtttactgtt gaaagcacgg ttataccctc cgtaaactga 120 

acattctgtt tcagcgcgtc gtgtcttagt tgtcctttgg ttcacttttt agtttggaag 180 



caagtcgttg tatagatgat acttagcaca tatagttgct gtcgatttgt tttaagttca 240 

gcattccgct gcctgaattt cagtaaatac cttgtccaac ttcgatgcaa tataagttgg 300 

cttcagtatc cagtcttgcc ttactccttc attgcaatct tggtggcggt ctggtgcgcc 360 

tcgtccactt tcacgatgta cctcgtcagc ttgtttgaac acttcctttc tcctactgag 420 

tatggcgttg gcctcttttt ccaagctctg ttgatgtagg tcctaccttg tcaaaacatc 480 

acccacagag atttgacgac aatcgtaatt ttaatccgat tgtatggggt tcctgtcata' 540 

gtcaatatat taacgcccat cctctcactt accaacgtct gttaccaact ggacaataat 600 

gcattcacaa ccaaagtgca atttttgtat gagttggaaa tatcgaaaca gttagtgcca 660 

gtaattcacg caaatagttg tgtcatggaa actttttttt aactttctgt tgtccaatca 720 

tcgtgctgaa acatttagaa atgtggcaga cagttgcatt tgatgtatca actgctgtgg 780 

tagtaacact tgttgaaact gtaagataga catgccaact ttctggtgct atgtgctaat 840 

tgtttatatc ttcctgaaga atggtacaat tcaaatgaaa gtgggtggga gaattgatat 900 

cattgatagt ggaataggtt attgcaatca gtgagtcctt ttttcagggt agctaatatt 960 

ccttactgat tatccattga ccaccagtgt ggcttgtgga atgcgtgaag tt 1012 

<210> 5 
<211> 1386 
<212> DNA 

<213> Physcomitrella patens 
<400> 5 

ccgtgggact tagttgtctt cacttcatta ggaaatctgt ttgagcctct ttccattcca ' 60 

-atcttctcga caaaataggt tttttcagtg actcataact tattgtgctt tgcaaaattc 120 

ccactaatcc gaaatgtatg gtgtgatcac cgagctttta aattgattgt gtttgggcag 180 

tctacgaaaa atccagacgt ggagccttcg aggaacaggt tgttcgcgca ccgctacttc 240 

tgaacttcac aacgccgcgt ctatgtcgct ctaactcaga ggctataaca caagttagcg 300 

atgtccatcc ctctagtctt catatttgca acattaggag gaggcacacg ctggtcgaga 360 

. tgcccgtgga actcttccag attgctacca tcaatgcact cgtagacaga tccaaaagtc 420 

attccacatt attcaacatt aagggatccc caactgacca accaagagca ggtgctatga 480 

gtggaacttg ttattttcca aatgagcgtc gactacatat gcccaggcag aaggatatgc 540 

cgaggtatct gggggggcag gcatgtgttt tgtgtaaagt acccccgagg taagaacttt 600 

taagcggcgg cactggattc agaaacagtg gacagatata tccattgcca atgtattgat 660 

tggctggcga agaactgttg caaaccacga ccagccgtag gggcgtaaaa tttgaatcca 720 

ctgtttaaat ttcaaatttc aaacctcgac ggagtttcct ttagcttttc agatgggcgc 780 

agaacggtta ggaaactgtc ccgtcgcccg aatttgaatt taaaaaataa atcaaaacgc 840 



tagagcttcg attagtatgg 


gcttttttca 


ctcttctgtc 


caattctttt 


tgttttttac 


900 


ctcatgcaag 


gcggtcggct 


aaagtgactt 


acagggagga 


atattactga 


gagcaagagt 


960 


tttaccacgt 


tgtaggatct 


ggagaaatcc 


aacgatgcta 


ggcctacgca 


acgagtgtga 


1020 


ttcaacgcca 


gctataatct 


cattcgtgcc 


gtcgatcccg 


ccatccaacg 


gcgcagacgc 


1080 


tttgcgtggg 


aattgtacct 


tgcctacgat 


tggaatttga 


ctggcagctc 


ttgagctgga 


1140 


atttacttgt 


ctgcctgaga 


aagtitgaagc 


gtaagatgct 


cgatccaacg 


atgggcagaa 


1200 


agtgttcgtg ggcaggaacc 


aaagccctag 


ggcgggctcc 


tccttttatc 


tatctctctg 


1260 


gcatatctct 


tctcagtgtg 


cccccaggga 


cgtcttcttc tctccttttg 


ttcagcgtct 


1320 


cagtgctcga 


gggacggttt 


gccgtctttg 


tttcttcgtt 


ctcgttgtag 


atcatcctta 


1380 


gcgaag 












1386 


<210> 6 
<211> 997 
<212> DNA 

<213> Physcomitrella 


patens 










ttgtgacctc 


tcctctcgtt 


atcattacgt 


agcacgctac 


gaacaggaca 


ttctgtttca 


60 


gcgtctaggg 


tctttcattc 


agcatttaga 


accaaatcat 


tgtatagatt 


tcacccagca 


120 


taccaagtag 


ctattgattt 


gttgtgagtt 


cagcatgctg 


ctgtctgatc 


cgaagattat 


180 


ttgtaattga 


ctgttatatt 


tgagcatttc 


tgttcaatca 


tgtggtgtgg 


gtttgaattt 


240 


taattagcag 


gcactgagtt: 


ccgtgacccg 


aaaagaattt tctgagaata 


gccaggtgag 


300 


ttgcttcctc 


ttttgctgtc 


ggggatattt 


cttccgaaat 


atgggttatc 


cagcgctcta 


360 


tccgcttctg 


ctctgtgcta 


tgtgaacatg 


aatgcaattg 


atattcttcc 


aacatccata 


420 


taactaatgc 


atacttcata 


agaaagcaga 


ccgtcacgga 


taatgggaga 


aacattttcc 


480 


agtcatctcc 


gtgtccacat 


ttctctcaca 


cgctaaccat 


gttagtaaac 


cgcaaggact 


540 


gttattaagc 


aatgaatatg 


tctgaaaatc 


gtatgtgatc tgttgtcaaa 


gtgtcatagt 


600 


acccgtcatc 


gccgcattgt 


gcactgctgt 


cagatccgca 


gtaaataccc 


gctaacgaaa 


660 


ggaagagaaa 


gatgagagaa 


gatgagattg 


tcaccgggag 


agaatcagac 


gcagtcatca 


720 


gtgatactat 


tcgacggacc 


taacctcgtc 


cgtaaaatgc 


aagaatttaa 


cgaggcagta 


780 


aaatcagctt 


aaaacctccc 


cgcacgctta 


acgtaaccat 


ggctgtgcta 


a acatccacc 


840 


aagagaggaa 


acaccgcaca 


tgaacaactc 


ttctgaacta 


cacgtgaagc 


agagattgag 


900 


gcgaaaagaa 


agccacagat: 


cgctgctcct 


caagtggtga 


atttattttc 


ccttggaaca 


960 


aaaatggagg 


tgtggaggcc 


aagcagcaat 


ttcgatc 






997 



<210> 7 



5 



<211> 624 
<212> DNA 

<213> Physcomitrella patens 
<400> 7 

ctcgagtgca gtagacgaca aaatggaagg atgcgaccag ggatgaacgg gaagagtatc 60 

attaatgcga gacccttgga gttgaaggcc acgagtggga cagcgatgcc gagaaaaatt 120 

tgaaaatcgc tcatcccaga caaaatatct gtgggccagc cagggtttcc cagccagctg 180 

ctctgccgtg ccagccgtag atctgctcat ccgacggcca ctgcgcccca tcctggactt 240 

gtaccctccg gcatttggaa agtgtcagcc tctccctgac gaacatttca cctcggctgc 300 

ccgggaggcc aggagcgtca gatgggagat ctgacggcgg ggcggaggag agacctgaac 360 

cggcgggcag gggaacgatg tcgttgcttg ttcttctggc tgaggcgtcc atccccttta 420 

cctccgctgt gtgtttcaaa ggccgatatc tgcgcttccc ttgcggaccg agctctgtcc 480 

cgctcgctta cttctctccc accgagcttc cgaggttggg cattcccacc cttccttctc 540 

ttctcctctc cttctctgct cttcttctct gttgtctgcg gattaggtct tgtggtcttt 600 

cgagcttcgc acagcttgag caag g24 

<210> 8 
<211> 1146 
<212> DNA 

<213> Physcomitrella patens 
<400> 8 

gcgcgcggtt ggctggaaga agagtcgaga agcgatgtgc ggcagcggca gcagcaggag 60 

gggcaggcag tcaggtgcag cacgtcgctg gggtgatgcg gagggacttt gccggttggc 120 

tggggtacag aagcgagggg taaatatagt aagattacgc gcggcggaag gacgcgatgg 180 

ccaacgaggt ggaggggttg gggcggtttt acgtgtacag tatgagactg acactgacgt 240 

tgatcctgcg cgaaccaccg gggctagcgg tagtagatag ttggagcgag agttcgggag 300 

cgttgttgcg gataagctcc ggcgtttgac cccagggtgc aaccgtagtt gcatgggggt 360 

ggtgggggga ttgaaattgg aaccggactt ggagttgaga agttcgggtt gtttttggag 420 

gcagttgaaa gacgttttta agaagtttga gctgttggaa atacattgtt accctgagct 480 

taagcagtgt gtagtggcga tgtgtttaat tgtctgattc ctgtatgttg gtgtgtgcga 540 

ggcgtgtgag tgcgtggttg tgtgcttgac gtggcggtta tgggccgtgc tgtcggaatg 600 

atttactgga ttatttggtc cattggtttc gtggactgga gacggtggat gtttgtagtg 660 

ottgtgtgaa caaggcgggc atgcagatga tgggctcgca ataaagacag ggtcatgtcg 720 

ggtattgccc agatgaaagt ctcttttggt gatgccgata cggaaaatgg aagttggtac 780 

agtcgcacgt tcaggcgtca tgggttgcct tggaagtttg cattggaaga gagagttgag 840 

ggtgtcctgg atgatgtcca cgaggtggtg tttgaatcga tgttgtgcga agtagacctg 900 

6 



agcaccgatg 


tgtgacaacg 


gaatggtgag 


tttgtgtcaa 


tgaactgtga 


gcgttttgat 


960 


tgaggcagac 


attccaaggg 


gauggttititt 


m.mm4. 4«4*4. .mJ. ^ 

cggttttgtc 


ttttaaggct 


ggcgcctgcc 


1020 


tagcctcctt 


tgtccttcag 


cgcatgtttg 


cttgtgacgt 


ttgcgttggg 


attgttagta 


1080 


ttggtctgga tggaaatttt 


atcgtttcta 


tcggcagcaa 


ctaagtgcgt 


cttgtcattc 


1140 


ccatgg • 












1146 


<210> 9 
<211> 2973 
<212> DNA 

<2 1 3> Physcomlt r ella 


patens 










<400> 9 
ggatccattc 


aacggaggat 


aagtatgtag 


ggtgatactt 


aggctcattc 


attcattcaa 


60 


ggcgtattta 


attaactact 


aaagaaaaaa 


agggggttaa 


ttggggtgat 


tgggttatgg 


120 


aatgaataaa 


tgaataaatg 


ggtccccccc 


ctccccttcc 


tttcccttcc 


ctgcattaca 


180 


tatatatata 


tatatatggc 


atgcggtgct 


gagggtgtgc 


atgtgggggg 


gggggtgtgt 


240 


tgagagtgtc 


aacggtgcca 


gccacact'ct 


ccggacccct 


tcccattttc 


ctttcctttc 


300 


ctgccctgtt 


ccctgtccct 


gctcccaccc 


actttccatg 


cccttgaaca 


cttcctgata 


360 


aaggccctcc 


atccctccct 


ttcccttctc 


aacccattta 


attctatggc 


ttaaacatct 


420 


aaatcattac 


attcttatgt 


actaaaattt 


tatttataga 


ttgataattt 


tcttttaatg 


480 


aattaagttt 


gaattttatc 


tatgttttag 


ttccacaaga 


tttgttttat 


ttattacatg 


540 


aaacttcaaa 


agggatttga 


atatattaaa 


aatttccatt 


tataaatgaa 


tattcgagtg 


600 


agttitraatta 


aaattatttt 


tagcgtatat 


atatatatat 


atatagatat 


ggataaaata 


660 


caattgaatt 


aacctaggtt 


taatttttat 


aacaatgttg 


aagtgacctt 


catgtagtgt 


720 


gagtgcaagg 


atgtatttgg 


atat^gatgt 


acttcaaaaa 


aaacatgata 


aataattgca * 


780 


tagtattaaa 


gtttatgcaa 


taaagaagct 


agaaatgact 


aaaaattatc 


acaagcttat 


840 


taactcacaa 


acaaatcaat 


gatatttcat 


atcaagtgaa 


actgttaaca 


aaagaaagaa 


900 


ttacgtgtat 


atttcatgat 


catattcttt 


tgataattaa 


tggtagggta 


acactatgaa 


960 


cataaaatta 


ttgctctcta 


caatttatca 


aaagtataat 


aaaacaaaaa 


taaaacagaa 


1020 


atcataattt 


atgagtctct 


acagggattc 


actgtcaaat 


attgtaagta 


aagtgtgtac 


1080 


tattaattga 


ggggattgtg 


gtatgccatt 


ggaatacgtg 


gatcaaaagc 


tgaaacacaa 


1140 


gaattttgaa 


actcaaaatt 


acattaaaat 


gtttgaaaaa 


taaacacaaa 


atacaatttc 


1200 


ttcagaaaaa 


aaaaaaaaaa 


accatcgtca 


ataatgacag 


tcaacaaagt 


cagcatgcat 


1260 


gacgagctca 


ttgtatttcc 


tccaaaaaaa 


aaaaaaaaaa 


gaagaaaaag 


tgggccctca 


1320 


gttaaatcag 


agaatgccac 


atggtgatag 


gagaagagcc 


gatcataggt 


gatacgtggt 


1380 
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catgggatca tcgtttccat gcgcggaaat agatcgaacc cctctcagtg tctgacgggt 1440 

caacacgggt gatcgggtgg acccaccctg accagcccaa caaaacgcag ggaggaagag 1500 

gtggcaagta agtaagtccc acgtggattc gagacaaaac gttgtacgaa taatatacga 1560 

agtgagaaaa aaccacagag cgggtggcag tcacgaagtc gcagacacaa accgggctgc 1620 

ttgacacggc gacccgttcc ctgttctgcc gcccgttccg tcgccatctt tgtctcattc 1680 

gcacaaggtt ccttttccag tgccttctgc gcgggtccca ccctctccat ctgacccggc 1740 

ccgggctaac ccgttccgga gcagatgatg atcgacccgt ctcgcaggct ccttttgtgc 1800 

accgcgtggc ttcgtgattg ggccattgtt gctgtttgct gtttgttgct ctgctttctg 1860 

tgtccgggcg gcattcctga gaggcgattt gcatgcgcag gctcgttgta gagcagcagc 1920 

agcgctgagg gtctcgtcta ggcttagtct gcttctatcc ttcgctgctg tcgcctctgc 1980 

ttcatcgtcg ccgtctcttc tcaggttaga gcactttcaa gtgttggcca ggactgagta 2040 

taggaaggag ggtttattta tttatttatt tatttattta tttttctgtt atttttattg 2100 

ctggctgatg tccatcttcc gacgcgatcg tcgttttttt ttttttgttt gtttgtttca 2160 

ttgtgttgga ggagtgtaag atttaatcgg atgcataggt tgtgtgtttt gcatgcgttt 2220 

agagcgttta catgtgcgat gcacgagctc tggtgtcgtt tagaggccac tgatttagta 2280 

gtttcttgtg cgagggggat tagatcttgt accgcaagat gttgctccgg ggttgtggtg 2340 

gcgatggcgt tttataatta acatatagtt caatggtgat gatttaatta gcagtggtgc 2400 

atgagttagg tacggatcgg gcgattgtgg atccggactc gtgttcaaca ataggctgga 2460 

ttctcttcta ttgcgattgg ccagttctta catgcaatcg ggtacacgat cgctgaagta 2520 

gaacaaatta aactcatcga ctgaattttt gccgtctcct gaactgtcga aatagagctt 2580 

gaaaatttga ttgatagtga ttgtttagtt ctctgcgaaa tcgttctaca taatctttaa 2640 

attctgaatt aatctcaatg tattttgaca tcagctgatc gcttgtccgc tcgctcagtt 2700 

caattcgatt gagtattgcc tgcagatttt tcagaaaaat ttaagtaatt tgatagtaag 2760 

aacttgactt cctgtggatt ttaaacagta tagcatatga agtgccaggt tttctgaatc 2820 

ctccatttct tctaatcgct atttccgaag acttctatac agtatggagg gcgttctgta 2880 

ctgtcctgat tgcgagacat gttttacgac gaaaatttac tgctccttag aactaaaatc 2940 

ttctgaaatg gttgggcagg tcggtattaa gaa 2973 

<210> 10 
<2H> 1128 
<212> DNA 

<213> Physcomitrella patens 
<400> 10 

agcagtgcga cacatctttt gcttttttca gcacgtctct tagctcggct tattgaactt 60 

8 



cgattgctaa 


cgtttgtggc 


caccgaatta 


ggcctgctag 


cgtagatcaa ttagaggtcc 


120 


atgttgcaga 


aagcttttgt 


ttgtaaaaat 


agctgatatc 


tggacgcata cgactggctg 


180 


atataattca 


gtgccattca 


cattatttgt 


taacaggtcc 


agggttgttt gtagagtcgg 


240 


acagcatttc 


tcg-tcggaat 


gttggcgccg 


ttttgtgaaa 


tgaaaggtga ttatgggtaa 


300 


aatgcataca. 


tagtcctgtt 


gactatggct 


gagtggataa 


gatatatttc catcacaggt 


360 


tagatttcct 


gcggagtgtg 


aactgtgacg 


taaaatcaca 


gagtgcgtcg tcttagccct 


420 


agcccccgaa 


tcatccttta 


cgatggatgc 


atgttcggat 


gttataattt gatttttttt 


480 


ttttttcgtt 


gtttacggat 


ttttgaccag 


tttaccattt 


gttgtttcag ttgtgatggt 


540 


ttggttctgc 


gtagataagt 


ttgagti:gag 


tatatttcgt 


gagacgtcct acgccactgg 


600 


atatgtatcg 


ctgaagcaga 


atactgagta 


ttgtaattgt 


atgttccaga cgtttcagta 


660 


gttagtgaca 


gtggaatgaa 


gcaacttggt 


ttttctcttc 


tatggtcttg ccaatcgttt 


720 


ccgtcgcgag 


attgagcgta 


cctggtcaag 


ttgtgttatt 


ggtgagctca atgtgcttgt 


780 


gattggtcaa 


tttccatata 


taagtgaagc 


gccattttca 


aggagacaag gagctctatt 


840 


ctaggcattc 


accagtcctc 


ggctccaggg 


gcactcggga 


gatgaggtca agtctcattg 


900 


Cuagagccgg 


ccggiigacca 


ctictgagglig 


gcccattact: 


tgggatatat tccatggcga 


960 


ggtttggttt 


tgcatgctat 


cgacgaagcg 


gctagaactc 


tgggaatcta attattttgt 


1020 


ctaatccgtt 


gcaggacgab 


cagccgtgaa 


acagatacct 


atattttaag aatgtttatt 


1080 


cttgtgtgcc 


atgtgtttgt 


tat-tgaagaa 


taatcttcgg 


tgacggtg 


1128 


<210> 11 
<211> 3035 
<212> DNA 

<213> Physcomitrella 


patens 








<400> 11 
cgagatcggt 


ctgtaagccc 


tgtatttggc 


atggaatatc 


ttttaacaaa gaagatccat 


60 


cttttagttt 


ctcataatgt 


tgaacaacgt 


acttaaggat 


ttagaaagtg tgtttcgttg 


120 


cttctcttgt 


tagaatggcg 


ttatgagcct 


gtgctgtgtt 


cttcttttta gctggatgaa 


180 


ctgtacaatg 


tttcacaact 


gtagcctagt 


tgatcgtgca 


tatttgcgtc atgactcccg 


240 


gcaagttgat 


gtgttttttt 


cttgcttttg 


aatcccttca 


acctgtattt ggtggctcgg 


300 


acagtaactg 


ctacgatata 


cgtcagtctt 


tagtaagtaa 


tatgttcctt tttctctcgc 


360 


ctcacgtatg 


tcatatttcc 


tgagatagtt 


ttttaatttt 


cgctctgtgg tttcttgtag 


420 


tcctttcact 


gcgtgccgct 


atcacagctt 


ggtcatagag 


gaggccacat ttccagcgga 


480 


ccaacttgag 


gttacagcat 


ggactgagga 


cgggcttgtg 


atgggagtcc gtcacaaagt 


540 


ctacaagcac 


attcaaggag 


tgcaatttca 


tcctgagagc 


atccgaactc aaaacgggat 


600 
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gcagatcgtc 


ggaaactttc 


ttaagatttt 


agatagaaag 


gagacggctg 


acaagaagga 


660 


gttgaaacac 


aaattttgga 


gagtgtttga 


gtgatgagtg 


atactgggat 


ccttttttat 


720 


gggaaagatt 


gccagcagca 


gtaagcttgc 


ttttgttaga 


ttcctctccc 


tacagcgtgt 


780 


acctcctcga 


atatgcactc 


aagcaagcct 


agaggttgct 


gctatagatt 


tctcggtaag 


840 


acagggtatt 


attgaggcat 


tttttgcgct 


tccagatgga 


gctactacca 


caagtatcta 


900 


tcctattatt 


atctttaact 


tcgatggatt 


tgccatgatc 


actgaggtac 


gtcgaagttg 


960 


tgattggact 


tgtagtgatc 


acttccagag 


cgagctatca 


aactggtgcc 


tagaggagca 


1020 


acgcaaggag 


tgctgaatta 


ttctaatgat 


ctcatttagc 


ctaagttttc 


cgtcaaacat 


1080 


agtgatgttt 


ttaagiitcai: 


ctcgttagtg 


aaacatctca 


aagaaggtac 


accattaaat 


1140 


tattgcaggg 


gttgtgatga 


ctttatttaa 


tagttgacct 


cttcaattga 


gaacgcgttg 


1200 


ctctcctttt 


gtatagtttc 


aatcatatca 


aagctctatt 


tgttctctgt 


accttaagcc 


1260 


ttgtgtaagg 


catt:taaata 


atctcttcca 


cgat-taagat 


ggtagtitatg 


tcgccggttg 


1320 


caacttccaa 


gatgtcctaa 


tgctatagtt 


ctcattcaca 


actcaggagg 


tttgttgttt 


1380 


tatgtttttg 


aaagtgacga 


aggaaattgt 


ttacttttcg 


ctttgtgtct 


gtgtatttta 


1440 


gaatagtacc 


ttaacttctt 


acacaatggt 


gtctaatttg 


ttattcttgt 


gtatcacgag 


1500 


cgttaatcgg 


tttggacgtc 


ggaccctttt 


aaccaatctc 


aattgcttct 


gttctaatcc 


1560 


acgcgtccca 


cgaatggcag 


gtcaaatacc 


gattattgcc 


cgactotaat 


cgtgacagtc 


1620 


actgagacta 


ataacgggag 


gtcactatct 


tgtgacgttc 


tcgttatttt 


aaaatictgta 


1680 


taatggcaat 


ccctttctgc 


accacggcga 


actcatgatg 


attcttatcg 


agtcctgctc 


1740 


accaacttta 


-tcacaagacc 


ctiacggatict 


aactatigatg 


accaaaagct 


tgttctacgc 


1800 


atgcatgagt 


cccttcgttt: 


gggagatttt 


agaattctta 


ggaactcaca 


cgttgtccat 


1860 


aaattttaac 


caccgggcaa 


cataggatgt 


tgacatgtag 


tcacaaattt 


agaaaaaccg 


1920 


acttcaaaag 


gttgcccacg 


tagacaaaac 


aactcgaacg 


cagaaalicca 


ggcgaccggt 


1980 


gaaattggaa 


cattcacaac 


aaagcgagaa 


gaggtiicaaa 


aaaaccgcag 


agtaaaccct 


2040 


atgcgccaga 


ggggaatggg 


agatccacgg 


gattcggaga 


tgaaaaggca 


tcgcgcgagt 


2100 


aaaaacaaag 


agtgcgggga 


gcaagggcat 


ccagaagagt 


ttcactigaga 


tctacagtgt 


2160 


aactcagaaa 


gggagccact 


ggtacaaatg 


ccagctttgc 


aacgcagaac 


gaacgcggga 


2220 


gagctaacag 


atccgggctc 


aaaatctcct 


tcttctacct 


ctcaagccgt 


ccacaaccct 


2280 


cattctccat 


tctcgcacta 


ttctcctcaa 


accagttgca 


tctgcggttc 


cctccatctc 


2340 


caaccctacg 


gctttcgtgc 


gagcttattt 


gttgcctata 


ctaaggttaa 


acccactcac 


2400 


tttgttgcct 


atactttgct 


ttgctatttg 


gttgctttcg 


tcttcgcttt 


tgttctttgg 


2460 
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tttatctcaa 


gtgcacatgt 


tctcgcgacg 


ctgtgccgct gtaggggctg gtgggcttat 


2520 


agacctgagc 


accgaggcgt 


gggtttgctt 


cgactggctg tggttgttag caaggtgttc 


2580 


tcgtaaggta gttgtgttca 


gagctagatc 


ttgtgacggt gatgcgaaaa atgcgttcat 


2640 


cagagttaag tgatagaggg 


gcttttcgtg 


agatctgctt ctgtgatgga tctgctgtga 


2700 


aagcggtccg 


cgttctcctt 


tatcttcagc 


tctgtgtctg atgtttggga aatgcatcct 


2760 


ttggatacgg 


tgcgattcag 


gctgtatatt 


gaatccccga gttttggaaa tctttatgac 


2820 


ctcacti'taa't 


ccgaaagcta 


atgggctgta 


ttgagtgagg ctaatacaca tctctccata 


2880 


ccgcgcttcg 


gtttcgactc 


gtctlraccga 


ccacattgat tcacatgcgg agacatcagt 


2940 


gttggatcac 


ttacagtctg 


acctaaatag 


cacgtgtgct acacatagtt tcaatgccag 


3000 


taacagtctt 


ttgatgtgca 


gagtatttct 


tctcc 


3035 


<210> 12 
<211> 1221 
<212> DNA 

<213> Physcomitrella 


patens 






gctagtgcat 


acctgtctcc 


tgaaatgcta 


tcacaccttg tcaggtgggg ttatggagtt 


60 


tatttgtagt 


agctaagcag 


ctcgaagagg 


ccagtgagag actgattttt caggggttgc 


120 


aagggaatgg 


ttactcgagt 


aaagagccag 


cgctgtcgag accttcttgg tgcaattcca 


180 


tcttitgaaag 


t:at:gcatcac 


aagttagatt 


cgtggctttt gagcttgtcc tcattatttt 


240 


gcctaccatt 


tatgtttttg 


tggatttagc 


atccgcggcg tttaagtttt tgttttaaca 


300 


ttctttcttg 


taggttcgga 


tagaatgttg 


gggacatttt atgcttgaag agcgtcttgc 


360 


actgtcggac 


tgtaatgcaa 


tgcttgtgga 


cctcagcctg gcctgcaata cttgtatatt 


420 


cgtigaaaaca 


atcatagcga 


ctctgtgtta 


ttcttcccat gtcattcact ggctctcgaa 


480 


ctttgtcgaa 


tacatctgat 


gggcacgcgt 


gcagaagccg ttctttaacc tcgatgggat 


540 


ggattagtac 


gatttgctgt 


catttaaaac 


tatttgctat ccgtatttgt cttctgttcg 


600 


gaaatttgtg 


tagcttgtta 


ttttatggta 


tgttgtagga aatcagcttt ggtgagaaat 


660 


ttgtttcata 


acgacacaat 


ggaatgatga 


attaaattgt tgccagacca atatcgtatg 


720 


tgtcaatctg 


attcctcaat 


gcagatatgg 


ttgtggagcg tctgctgtac ctccttgttt 


780 


-taaccgccgt 


atctgaacca 


actcgaacgt 


agtttgaaaa atgcactaaa tgatgcatat 


840 


tcaatcggtc 


aagtcatatt 


aaacacgcgg 


ttttgaaagg tagcaggtgt atataatata 




aacatgtata 


tcgcaaaggc 


ccattcctga 


cattggatgg tgctaattaa gatctaatga 


960 


accgttcctg 


gcaatgtatc 


tatcaagcaa 


actgaagaca caatgaatcg ttgagtgtat 


1020 


gtagaaacac 


aaaacgatct 


tgtatttcct 


tttcatgtgc cagagtgagc ctcatcgatg 


1080 
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tacactgata 


ggactcaact 


ttgatatttt 


ttgaagattc ttatgcctga ataaggtact 


1140 


tggaatcata 


gttctttgtc 


tcatggctta 


acttgattaa 


gatttgggga tttggaacct 


1200 


ttgtaaggag 


gcaatgaatt 


c 






1221 


<210> 13 
<211> 3060 
<212> DNA 

<2li3> Physcomitrella 


patens 








<220> 

<221> misc feature 
<222> (1)..(2301) 
<223> a, t, g or c 








• 


<400> 13 
agactctact 


aattgacaag 


tatgtgacta 


caaaaggcca 


caagactctc tctgcactat 


60 


aactataagg 


ctcatatttt 


ttgtccatgt 


agcttgtata 


tatatatata tatatatata 


120 


tgtatattta 


aatcaaaata 


tttttattca 


aaaacaaaat 


acaataaaaa accaaaaaat 


180 


attttaaaaa 


taaataaaaa 


attattaata 


cttttatgaa 


gctattattc aaatttattt 


240 


ttaatttcta 


atttaagatt 


tattattttt 


tcttaaattt 


attaaacttt ggaatttatt 


300 


tttaaaataa 


ataacaataa 


aataatttat 


agtgttttta 


ttgataagta aaattaagag 


360 


ctaaatttgg 


atcattatta 


caaagttata 


atacttaaat 


atttattgag atatatttaa 


420 


atttaattaa 


tattttttat 


taagttatat 


atatatatat 


atacacatat tatgaaatta 


480 


tttaaaagaa 


gttagtagac 


ttttaaatat 


tttttaccat 


gttttaattc tagtacaatg 


540 


tatttaaatt 


atcttattaa 


gttatggaaa 


agaagttagt 


aggttattaa atgttttgtt 


600 


agattggttg 


taaaggtttt 


atgataatct 


tgtatgataa ggttgtttag catagtttat 


660 


tttgcttaat 


taaaaaaaat 


tacatcttgt 


tacatttaaa 


tttaaaaaat acatactata 


720 


cacatatctg 


tatttagatt 


gcttttacaa 


tttttatctt 


tttgtttttt gcatatttca 


780 


aagaaagccc 


agcatgtgta 


taataatttg 


tataaccctt 


agaaattaat aatatttaag 


840 


taaataatnc 


ttatttataa 


ataaattact 


gtttggtttt taatncaaga atttaaaaga 


900 


cccaattgtt 


tattccaaag 


taatagtagc 


ncattaataa 


aaatccttca aaaatgaaac 


960 


taaacaaacc 


aatgcatctc 


aaatgaaaag 


gagaagaatg atcttacata gacanccaca 


1020 


aggagggaca 


tgacaactta 


attagactat 


gggtttagga 


acatcaacca ttccctacta 


1080 


ccaaaaaagc 


ttacatgatt 


ttaaataaca 


caatattcct 


tgtgactttt gtgcattatt 


1140 


gaggatatcc 


atctatctag 


attttggaca 


atgttttact gcccaaattt caataagaac 


1200 


cattcacata 


ttttgaaaca 


catttgatac 


actctacatt 


oatgtctaga gtatagggac 


1260 


ttgggtttaa 


gattagggtt 


tcagattagg 


gcttgcaggg ttacagttaa aagttaggat 


1320 
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taaagattta gatggagtct 


tggttcagag 


agaaaaaagg 


atttggggta 


aagtttttat 


1380 


gaaagagaat* 


catcgcccaa 


acaagtagcg 


ggactgctga 


atgccttttg 


caatgaatga 


1440 


aaatttatca 


acgtccgtca 


atatgtacaa 


gaccatcaca 


taatggcccc 


cctgaccaca 


1500 


atttgaaaaa 


cacacacttc 


ctgcctggaa 


ccagtaatac 


aagtcattgt 


aggggagaga 


1560 


gagagaggga 


gagagagctg 


tagctgcgta 


taataagggc 


ctcgcagatt 


cagtgctacg 


1620 


tcgtatggat 


acaccgtatc 


acttctggtg 


tacaggttac 


tiaaatactac 


tcgacacggg 


1680 


gcgggccgat 


ctgcggaacg 


cgccggggcc 


atgtcccagg 


gccctaggcc 


cgccatattt 


1740 


ctctcgtcca 


cccgggccta 


cgcaaacttt 


cccttctcac 


tttcccagct 


cacgctctct 


1800 


gttcaacgca 


caacaacgcg 


tagccgagac 


gggttcggag 


cacaaagtca 


cccagcccgg 


1860 


cccgacccgt 


gcccgtctgg 


cgcctatctc 


tctccgcctc 


tgggcccgtt 


tcgctcctgt 


1920 


ccttgtgtgc 


tctgtctggc 


ccttcaccgc 


gcttcattgc 


ttcttcgacc 


gagagcctict: 


1980 


tagctccgtc 


ttgttcacca 


ctgccgcggc 


actccgaccc 


cttgcatact 


ctcttctgcg 


2040 


gtgcctgctt 


ctccccatct 


cctgcatcgg 


tgccctgttg 


tgtttttttt 


t:aaaggt:cag 


2100 


tccctctatc 


acgtcagtgt 


ttcgcatttc 


cg-tgaagtgc 


tcagggtttt 


ttttgctgcg 


2160 


aactgtcggt 


ggagatgrtgc 


tttttgtcgt 


gtttgatgtg 


tgtgcggtgc 


agcoataata 


2220 


ggtttcttgg 


aggaggaggg 


agagtcttat 


tttagtcttg 


ttgcccgqtq 


tcrctcoaaac 
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2280 


gcgaatgtgg 


gtttatggta 


ncgcacaggt 


ctgcgtttgc 


gatatgtcrtg 


tiagaaccctg 


2340 


tgccgagcga 


tcatcataat 


agtagtttct 


cgtttcggag 


gggctgggct 


t: Q't ca a cT'k cro 


2400 


aacgcagagt 


cgtagttttg 


agagttccag 


acgcgcatcg 


cgcagckgtia 


criicraGfa'bcr'ta 


2460 


gcttctcggt 


gtgtttagtc 


aaggtttcgc 


ttttccgatc 


ticggatcatg 


"k 1: ti acg't cc g 


2520 


tcctttaagc 


tggatctctt 


gttctttaca 


gaacttgttc 


atcgccctiga 


ctaagttgct 


2580 


ccagtgttgg tctgaagacg 


acaagcctct 


ttctttcttg 


aatagtaaga 


agaggaattt 


2640 


aatictgaagg 


cttgttttgt 


acagtagttg 


gtcgtttatt 


ctttgatgtt 


taacttagcg 


2700 


tttcgttgta 


cttctactaa 


tgtactcttt 


agcttggtcc 


gaggctatta 


tttaatgagt 


2760 


catgccctga 


agtcgggaac 


agcgggttgc 


acctacaatc 


atatggatat 


gagga-ttcgg 


2820 


gtcgagtatt 


aacttgtagt 


cctttgttca 


ttgtttttga 






o Qfin 
^ooU 


aactgcctga 


atagcacgca 


ctgctttccc 


tgcgttcgaa 


tcgtcatcaa 


cattactatt 


2940 


gtgtaatcca catggctaca 


gctgctgtaa 


ggttctgcgt 


caagggcgtt 


cttcaagaaa 


3000 


taacctatgt , 


cttccttgaa 


attaaatatt 


ggtggttgtt 


gtgcaggtcc 


gtattaaata 


3060 
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<400> 14 



attgtccatg 


tgcactacta 


aacatttttc 


agcacactcc 


cttccccggg 


attgagctct 


60 


tgctgtgtag 


aactctcgtt 


gcaagtatca 


gtgattgcag 


actttgactg 


gtgagcacag 


120 


attcaacaga 


ggtttatttc 


gcagatgact 


atggtttgta 


aaaatagcag 


atatctgggc 


180 


tcaattctaa 


cggctggtat 


atgtcagtac 


ctataaactt 


aactgtttgt 


agctctagat 


240 


cggtgtggta 


aagtccggta 


ccaattcttg 


tcccttttcg 


tattaaataa 


agggtatttt 


300 


atttcatata 


tcgtcttttc 


cttttgtcat 


cacatctcta 


tcctgtgcat 


atcatggttg 


360 


tattctcagt 


cgtaatggtc 


tttcaagtgg 


aatgatggct 


ttgatgatgt 


gcacctggtt 


420 


gtgtctctgg 


gcgtcatggg 


cttcacatiga 


gctgcggtta 


cagatcacgt 


ccagcctcac 


480 


acaattaact 


aggcatgctt: 


tccatttcct 


tctgacgtaa 


atgacaggct 


ctgacaacaa 


540 


tgcctggcac 


ttcctgacgt 


gggacccgtc 


gattggtgcc 


gaagtcgagc 


aaaattctaa 


600 


cctccacaac 


tggtatcgtg 


aatattctag 


cctcttcctg 


agaacagtgc 


cggtcgatct 


660 


cgaattacct 


cgtaatagtc 


gtcaggcatg 


tatgtatgtt 


taaaaatact 


ccatgcggct 


720 


aaattatttt 


ttaaaattta 


tctttggatt 


tgaaatgaat 


ttctaccttt 


ttttacttta 


780 


agttacgagc 


tgcgattcca 


actaatgaag 


ttttacatac 


-taatcagaag 


aatgtcgttt 


840 


tttgaaatta 


acaggbtaag 


t:gt:tt:t:gaag 


aattaaagta 


tgatgattcg 


tcttttttat 


900 


atcaaatgag 


ttttgaatga 


ttcgtcgttg 


cattttttaa 


atcttggaat 


gaattgcgtg 


960 


tatgtgacgt 


gtatggaaag 


a1:acaaat:ct: 


catgtagtcg 


agtacaagac 


aattacacct 


1020 


cttatgttta 


tggttcattt 


gtacatagtc 


tacgttagct 


taaggtcatc 


gtgtgtgagt 


1080 


atagtatatc 


tcattaccta 


atttgaagtc 


cagtaaatg-t 


tagttatgtt 


accatcgacc 


1140 


agttatcacc 


gatgttgctg 


agaagcaatg 


tgaa-tcttag 


gaaacgagtg 


atiatttgaac 


1200 


tggatattiaa 


ttcatccgta 


atctataaac 


agacatgctc 


tactagcgtt 


aaaacataag 


1260 


ctiacagcaca 


aaatgatcta 


aaaaaatgtc 


atcaatcata 


agctgtgtat 


aatacatccc 


1320 


atigaatiatca 


acagtatgag 


tttgggtgtt 


tgtgcacacg 


taaaaacgaa 


ccctcgaatc 


1380 


gaaatgtgta 


ttactgaatt 


cacatgcaaa 


tgaattgttt 


ggatcattta 


ctgattaggt 


1440 


ctgtactcta 


ttaatgaaac 


atataataga 


tttaagactg 


tccagtcagt 


tttgaattaa 


1500 


gccttgggat 


ttgtggtctc 


ttcctcttcg 


gccactaaaa 


gtttaattca 


cattgatgtg 


1560 


aaagaaaaag 


tcacaactca 


gccttcgctg 


tgttagaaaa 


gctgcacgtg 


tgaggacttc 


1620 


tcaggcagcc 


tttccttttt 


cagttgagtg 


tcgaagtagg 


agcacacglzc 


gtcggtaacc 


1680 


ggctacagga 


ggtgtgcact 


gtccctttac 


cggatgtggg 


aagtcaccct 


atcctgagta 


1740 


tggctcacac 


ccaacgttgc 


tactccatcg 


cacagacagt 


tccacatgat 


agactgctcc 


1800 


gcgagaagcg 


tcactctcgt 


gcggtctcac 


ggcttctgtt 


gcggccgatt 


cagtgcaggg 


1860 
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agtcgttttc 


gagcttgcga 


agtggctctc 


ttgtcattcc 


cctgcttctt 


ccggcggcca 


1920 


ttttgatgca 


gaattgcgaa 


ttctgcagaa 


tatgttgaga 


actcgtcttg 


ggggttttcg 


1980 


gatgaggagc 


taaaacccta 


gagggacgga 


caattctgtg 


gagcttgctt 


gtaatcctgc 


2040 


agtacaatag 


aataatagag 


cgacatgtcg 


acgctttcga 


ctcatgctcg 


cgtgtcgtca 


2100 


ctgtcatcag 


tgtcgacagc 


gtcgaatgtg 


gtggcaaatg 


tggctgtgag 


gccgtgtatg 


2160 


atagtatctc 


ttctgccggt 


tgcgagaggg 


ttgtgctcta 


ggaaggggtt 


gatgtcgcgt 


2220 


ggacctctcc 


gaagacattc 


ttgtatgaag 


agtgtttcag 


taatgccgag 


agcttctctc 


2280 


ggtcaactgc 


ctgaccctga 


acaggtggac 


ttgtacatta 


atgcgttgtc 


ccagacgccg 


2340 


gacgccctgc 


agggattgct 


ttcgcggacc 


gaggggctct 


ttttcacatt 


ggcggatgtt 


2400 


gctgtggcga 


ctgatcccag 


ccaggtcacc 


gacgctgtag 


tgcagaaaca 


ggacggaggc 


2460 


tggcttggag 


gtgtctcgaa 


ttctcttcfag 


atagctctta 


ccgtaagctc 


tttttatttt 


2520 


tatttttata 


tatttttgtt 


tcttttttga 


actgtgaatt 


gtgtatattg 


ttttcctctg 


2580 


aaattttctt 


tcagaatcta 


ggtggtaaaa 


cattctgata 


cttatgctta 


ttgcacggtt 


2640 


tatctaattt 


actaagattt 


agtgtgaatg 


tgatgatata 


attttactaa 


aatttaagat 


2700 


ttttctaaaa 


tttaattgca 


gctagtgtta 


tctttcgagt 


cgatgctaaa 


acattcctgt 


2760 


tgacacgatg 


atcatgaaag 


t-bagatgtgg 


cttaatiaaca 


aatgcaggaa 


ttaatgaatt 


2820 


ttatttattt 


atttattttt 


gcagtttttg 


aaggatacca 


ttgctaagct 


aggcatacct 


2880 


tattcgtatg 


gtttcgcaat 


tattctttta 


actattttag 


tgaaggcagc 


tacttatcct 


2940 


cttacaaaaa 


agcaggtttg 


ttgttctact 


gattttctta 


ttttgtgctt 


tctttctttc 


3000 


actttttgcg 


tacaaatcat 


ttttgtgata 


tactaattta 


ttgtgt:aaaa 


ctaaaagaat 


3060 


tactatattt 


ttcagctaaa 


tatctgtcga 


tgtcctgtat 


ttactcataa 


gttttatggg 


3120 


tttaagatag 


tacccagaca 


ggactgagtt 


ccattggtag 


gtcagtactc 


ctgttagatt 


3180 


agggaggctt 


ctattgttgt 


atatctaatt 


gaaagtggtt 


atgtttaaca 


ggtagaatca 


3240 


actttagcta 


tgcaaaactt 


acaacctaag 


ataaaagcta 


ttcagactcg 


ctatcagggg 


3300 


gatcaagagc 


gcattcaatt 


agagactgca 


agattgtata 


agcaggctgg 


agtcaaccct 


3360 


ctcgcaggtg 


caattttgtc 


gaagtcctcg 


aagcattaat 


gttaagaatg 


cttgcagatc 


3420 


actttccggt 


ttttgacgga 


cacaaaatac 


agtcgaaggg 


actaatactc 


aataacttgg 


3480 


ttctgtatgg 


tagctcataa 


gggttgtggt 


ttatgatttt 


acagggtgtc 


tgcctactct 


3540 


cgcaacccta 


ccagtatgga 


ttggattgta 


tcgtgctcta 


tcaaatgttg 


ctaatgaggt 


3600 


attgcatcat 


gaactggagt 


gcttgaaaca 


gttgtccttg 


tgcggcatgt 


tgttccacct 


3660 


tagtttattg 


tgaaacatag 


gcgtcattag 


acaatccaca 


tttagagtaa 


tacaggaagg 


3720 
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tcttaccata 


tattcatttc 


aaagaggttc 


aacagacatc 


gtaatgcaaa 


gttctgtaca 


. 3780 


ttttctcttg* acttcaacgg 


gagaatatct 


attcttaaat 


gagatatttt 


ctgtggtact 


3840 


ggtattcaag tatgaat'gta 


tgtaactatg 


atttacttat 


gcagttctgg 


ctttgcaggg 


3900 


gctcttgact 


gaggggttct 


'tc'tggati'tcc 


atcc't'tggca 


ocrccctacaa 


ccratiticfchctc 




tcgttccagt 


gggagtggca 


tttcgtggct 


atttcccttt 


gtggttagtt 


agtcccttca 


4020 


gatgcttgtc 


ttcgttattt 


tttttccata 


tcaaatgtaa 


tgatgctggt 


catacagtaa 


4080 


catatagtga 


atttgttgat 


caaaatggtt 


gtccatggaa 


gctt 
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<400> 15 
ttgttgaatc 


atgttaattg 


ccaatiggtta 


ttaatgacca 


tcatattgta 


cctggaatgc 


60 


attggaaaag 


taatgttcca 


ctaaataaaa 


gttgatccac 


caaatattgt: 


tgtctagtca 


120 


tatcgacaaa 


tagattcaaa 


ataaattraaa 


attaaaatitg 


aaaatgtata 


aacattggca 


180 


tgaaaatzgat: 


attaatttaa 


aacaattcaa 


aacttataca 


attattitaaa 


atacattagt 


240 


caccgggt t a 


aaggagacag 


actgacagaa 


ttggattgcg 


gcaatcagta 


gcactgcaca 


300 


aat:aaat:t:iia 


acat:gaaaac 


attatgattg 


ctaatactct 


gtttgcatgc 


acttctacaa 


360 


caacaaaaac 


aaaaaatiaca 


aticaaacaaa 


acaagcaaac 


aataaatgat 


tttagatttt 


420 


gcatgataca 


agcaccagag 


ataattratga 


ccatgtgata 


aatacaattit 


ggaccattta 


480 


tatcctacaa 


aaaaaagaaa 


aaagaaaaaa 


gaaaagtttt 


tgtttgtatt 


tgatatcttt 


540 


attttgttac 


caaaattaga 


taa-tligcaag 


ccttgtattg 


tctgagatgg 


aatigtiatatg 


600 


taacacattt 


gagcaaaaaa 


ttaaattaaa 


ttaaattaaa 


taagattttt 


ttatatatag 


660 


taaattgtaa 


aattgaccca 


aacatttact 


aaatcaaccc 


acccattcta 


accatcataa 


720 


gaagaattcc 


gctatcaaati 


ccaggttggt 


tgaaaaccaa 


tgaaaaaatg gttggcttct 


780 


caaccaatga 


taa-tggatgg 


gtitaatittaa 


taaattcatg 


ggtcaattta 


aaaatitccat: 


840 


atatataatt 


aaaaatcaat 


tgcaaaaaat 


attttgacac 


aatcacacgt 


gttttgaaaa 


900 


tcatacatgg 


acaaaaatac 


aaagagattt 


tttaaccaat 


attttggaaa 


cacatttagc 


960 


aaggtgtcca 


atgcccttcg 


atracccacaa 


gaacacacct 


tactttgccc 


atatttaccg 


1020 


atatatgctg 


cagtcagtta 


gggttgaatc 


cctgagggag 




gtgtgaacaa 


1080 


agtccaatgt 


ggggccgcGc 


aggattaggg 


caccaggtgt 


gaacgaggct 


ccacccgagc 


1140 


gagagccagg 


aat-ttgaaac 


tggcatggga 


aagggggttg 


gttccacctg 


atggcacctg 


1200 


cccaccacca 


ctagtaaaga 


ttcaatgccc 


accacactgg 


tttttgaata 


taggatcttc 


1260 
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cttctccttc. taattcttct cttgatggat gaataatata accgatgaat gagtgggcac 1320 

atggacgggc ctcgccccct ctctactctc tgcaatacat tacaaaatac atacatgtat 1380 

acatagggat ttgatgactt caatacatac acactacaaa accgggtcag gaggggggta 1440 

taaccaggca agcccgagtg gcgggcagta acaaatacac acccccaaat cgtatgggcc 1500 

ggacacgtct gagcgacacg cgggtgccct gccctcctgc cccttccctc gccccttttc 1560 

tctcgaccgc ctgtcgccgg cccggcccag actcctgcca acctgggaac caacccccct 1620 

ttttggtgag tgctcttcac ttccctcgca ctcgctgctc aagttgaggg agggagggag 1680 

taggagtagt cactcacccg gcctggcccg gtccggttcc ggtccgcggg ggctgcgttg 1740 

cgcgacccgt tctcgtgggg ttatctctgg ttctctatcg ctcgctcttg tgcatcgtac 1800 

tgctcctact ttttcccatt gttgctatgc tcgctgccct gcgctgcttg gccgtccgtt 1860 

gtgcccctcg ctcgtcaacc aagcactgca gttcgctccc gcattccttt ctgcagcacg 1920 

gtgtatctct ctctctctct ctctctctcc tcatctgttt agcgctggtg ccggttctct 1980 

taaggtgaga gcttctgttc tatcggtgtt ctcggttttg gtatgtgtgg tgaccgacga 2040 

tcggtttgtt gtgcacggtc gctggatgta tggtcgtctt tgttcttgtt tagttctgtg 2100 

tggcgattaa cgtgttcttg gaggagtatt tttggccttt gtctgctgat gcgctcagca 2160 

gcgttgcgtt agtgtaggct tgtgcttcac atgagcgtgc cgcgcgtcta ggcgtggtgt 2220 

ttgagttgaa tcttttgccg aatgactata gttattgatt tcttgttatc tgaagatctt 2280 

gtgctgagat atgtggtgta gggattcgag aagtgctatc cccttgttgt gatgaacagi: 2340 

jbttcatttga tgtggttatc atactttgga gccttgcatt ccggatcgtc attagcttca 2400 

tctacgtggc tggatttttc cgtcaaccgt aggctgaagt gccttaaggg gttacatgtg 2460 

ctgagttgac tacatgtaac aatggcatgc aaactgattg cgtgcacttc atacttgtat 2520 

tcagttcgtt gtagagtccg ggatatatgt taggtagaat aaagaatctt atctctcggc 2580 

attcgaataa aaatttcatc ctttttgaat gcaccttgtt tgaaaggtcg ccccatgccc 264 0 

acggttgact gagaacaatg tctgcgcatc agttactgat ggtcgcacct gttgtcacta 2700 

atttgagtga ttaaggtttc ctaccggctt tttcttttcc actgatttag tttattcttc 2760 

atcaagttta caaatattgc tctgtatatc acggtttttg ttagtctttg atgtaatcat 2820 

attacctggg tttattatct agtgaactat gactgatatg ctggcgcata ttctcctact 2880 

taatttgacc ttattagaag atgttcgtac ttagagtacc ttttacttaa tgtaactgaa 2940 

tctatcattg ctttcgttct taatcgtgct acaaaattta actcattctc tcgttaacta 3000 

atgtttttga gcacttgcac tgtttttgaa ctcctgtagg atcattctaa aaa 3053 



<21Q> 16 
<211> 1879 
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<212> DMA 

<213> Physcomitrella patens 

<400> 16 



atctgtactg 


cacagtttta 


catttttcag 


gcttgcattt tgctgggatt 


gagttcttgt 


60 


tttgatagaa 


ctctggacgc 


aaatgtcttt 


gactgcttag ttgggctggc 


gagcacacag 


120 


taagaagtgg 


tacatgttgc 


cgaaactatg 


gatttgtaaa aatgaaacgt 


atctgggcgc 


180 


ataacgaact 


gcttatatat 


gtcgctgtct 


gttaacttca atctctacat 


gtccagatcg 


240 


atgcggtaga 


acccgaccat 


tttttgatcg 


atgtttgaac ctttttatgt 


taaataaaag 


300 


gtaccatgtt 


ttcagcgcat 


taatcatatt 


tattttggtc actatggact 


tgatgtacac 


360 


cggatgttac 


agctcagttc 


tacttcacag 


ttattcactg acttgccctg 


aaaaagtcgg 


420 


agt:gcagatc 


tcgttgtgtt 


ttggtaatct 


ggttggccag tctcagagct 


ctattttttg 


480 


atgaatccag 


ttgattggca 


ctcaatgttt 


ttttttattt tttactttta 


tcatagtgtc 


540 


aaggttgcta 


cgccaggaat 


gctgtgaggc 


acattctacc cgtatgaatt 


tcctcgttcg 


600 


caatagctgc 


aagctcaatt 


taggtttttc 


tgagcaagtt gtagaactat 


cgtgtactct 


660 


caccagattt 


cagcctctca 


gtgctgagtg 


ctttcgtcac gttiaactaat 


tgtggaagat 


720 


ttggaatcat 


ggttgcatcc 


cttagtttga 


cagaattcac agtcgttagt 


tgacctctct 


780 


atcttggtcc 


accatatgtc 


aacctgttca 


agagggctgt gctcggttag 


gtaatcactc 


840 


agaagtttct 


tcctacagaa 


aacttgtttt 


gtgggcatca tctacgtgga 


agaattgttt 


900 


gagcattaaa 


tcattcaaca 


ccttcagtta 


catgaagtag gttggaagca 


gtgccttgaa 


960 


gagatccttc 


acagaaagcc 


tctcaattct 


catgaagtct gcatctaact 


tcttttgaag 


1020 


tttgtacacg 


tgtgggcaga 


attgaagttg 


gttttgtgtt gtttgaaaca 


actgtaattt 


1080 


aataaatccc 


aaacaagact 


aaggccatct: 


aacgttttca catgttttaa 


aaaat-tacat: 


1140 


tgaactttgg 


gctaccgtag 


ttttagacag 


atgcaa-btaa aaataaaaag 


aaaaaaatga 


1200 


aaagaaaaaa 


gtcttgtttg 


ttttagttgt 


ctgttttgta cagttttgtg 


acctatttta 


1260 


gagtgtcatg 


tatcgaacat 


ttgactcaca 


attataaggt tttatatttt 


aaatgagtct 


1320 


tgttgtcttt 


tattttattt 


tgttctacat 


tctgtaatat taaaacttct 


attgaaaaca 


1380 


caacaaacat 


ttaatttcaa 


gtttttcaaa 


tttatatatg catattttgt 


atgtaaattg 


1440 


tacaaatgtt 


cataatgcaa 


attgaaatat 


ttaatgtaag attatagcac 


ttaaacctga 


1500 


tccaaaagat 


aataattttg 


ggcaaataat 


taaaattatg atagacaaag 


tttagaatgt 


1560 


tgtaataaaa 


atttatggta 


agtgctaaag 


tatgtaaaac aaatttcata 


aagaattgct 


1620 


tgtagcattt 


tcaagagaaa 


aaaataaata 


cttacgacta tttttaaaat 


gacacaaata 


1680 


gtaaataaca 


atatattgat 


gaggatatat 


atatataatc aaaattaacc 


attagtgatt 


1740 


tttaacctgc 


atagtattaa 


tgtatgggac 


cgcaaggtag acacctacct 


ctactggata 


1800 
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gcacctctca tatacacaat aaaactttta ccttgctaaa agtccaaggg aatttacaaa 
agaaattctt ttaaaaact 



1860 
1879 



<210> 17 
<211> 1823 
<212> DNA 

<213> Funaria hygrometrica 
<400> 17 

ctttcgtgtt gcctcaagag tgcctcgcga agaaagaagg ttccagcaac aactagagaa 60 

tgggtacagc attcataaaa ctacagataa ttatccttca aataagtaag aaaaaagaag 120 

gaaggaattg ataaataagc aagaaattaa gcaaagcagc cactcggcta gacaaaagag 180 

actgcacacg ggtggccaag gaaagcgccg gtcatagggg atatgcggtc atggggtcac 240 

tgtttccggc agccggaatc gattgcaccc tcgcagtggc tgacgagtca gaaccgggtg 300 

ccaagtggac ccagctcagt cgcgggcagg ccgaggtggc accgaagcct ggtcaacgtg 360 

gaatggatac gaatgtactg gatacgagat acgaatacga tacagtagag aaagaacgcg 420 

gcgagggtgg cacgaattcg cagacacaac cgagtcggcc tigacaaggcg ccccgcctgt 480 

totgccgccc cttccatcac ccgctttgtc tcattcatcc acggctcctt tttagtgtct 540 

ctgcgcgggt cccaccccct ctcactggac tcgagatgcc gccctgcgct gcctgactcc 600 

acctggcccg gcccgacccg ccccgacccg ttccatggca gatgttgatc gccccgtctc 660 

gcagctcctt ttgtgcaccg cgtggcttcg tactitggcca ttgttgctgt tgctgttgcc 720 

ggtgctctgc tctgtcttcg cgaggcactc ttgaggcgat tttttttgta gtagcgcaag 780 

ctcgttgtgg agccgcgccc agkaaatcat ctaggcttag tctgtatcca ctaccctccg 840 

ctgcgatcac ccctgcttcg ttgtcggcgt ctatttctca ggttcgagtg tttctgagtg 900 

ttggcgagga ttgagtgtag gagcgggagg ggtttgctgt tgtttttgtc gctggcggat 960 

gtcgatcttt cgacgcgatc gcatttttct tttgattgtt ctgttttgga gaacggaatc 1020 

ttttgattgg atatatagat tgtgtgtttt gcatgcgttt agaacgttta cacgggcgat 1080 

gcatgagtcc tggtgtcgtt tggaggccac ggatttagta gtttcttgtg caaggtggct 1140 

tagatcttgt actacgagat gtttctccat gattgtggtg gcgatgactt tgtatacttg 1200 

acgtgtagtt taatggtgat gattcaatta tcagtggtgc atgattttgt tacggatcgg 1260 

atgatcctgg atccctgatg attctttttc aagtaggttt aattctctgc aagcgcgaac 1320 

ggttggtcgt ctcattctaa tggtggcatg atcgcttatt aaattacgtc gactgaattt 1380 
tctccgtctc ctgaattgtt ggagtagcgc ctggaaattt gttagatgga gatttttcca • 1440 

ttatccggga aattattcta ttaattcttt tagactcact cgctcataac gcatattgaa 1500 

ataaaccaca gatgattgct tgatcactta ttcatttgaa tttgacagaa tacttcccct 1560 
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tcctgtttcg gtgaattaaa ttatttcgat 


atttagaatt 


taatttaata 


ttatttttac 


1620 


acagtacaac gaatgcaaag tggaggagtt 


gtcaggacaa 


ctgaatccct 


cagtttttct 


1680 


agtctatatt 


tctgaagact tccacacaat 


atagtagacg 


ttctgtgcta 


tcctgactgc 


1740 


aagacaaaat 


ttacgacgca aagtaacatc 


tcctttttta 


atctgagatc 


tcttcaaatg 


1800 


gttgggcagg 


tccgtattaa gaa 








1823 


<210> 18 
<211> 419 
<212> DNA 

<213> Funaria hygrometrica 










<400> 18 
aggagtgtta 


cacatctttt acttttttca 


gcacgcctct 


tcgctcggct 


tattgaactt 


60 


cgattacaaa 


cttgtgtggg taccgaacta 


ggccggctag 


cgtagatcga 


gtagaggtcc 


120 


ttgttgcagg aagttttcgt ttgtaaaaat 


agctgatatc 


tggacacata 


cgagtggctg 


180 


attggattca 


gtgacattca cattatttgt 


taacaggtcc 


agggttgttc 


gtagagtctg 


240 


gccccatttc 


tcgtcggaat gttggcgccg 


ttttgtgtga 


aatgatggtg 


attatggtta 


300 


aaatgcatgc gtagtcctgt tgactatggc 


tgaatggata 


agatatattt 


ccatcatagg 


360 


ttagatttca 


agcggagcgt gaactgtgac 


gctcaatcac 


agaatgcgtc 


gtcttagcc 


419 


<210> 19 
<211> 1333 
<212> DNA 

<213> Funaria hygrometrica 










<400> 19 
ggatccgaga 


ggaaagagag^ agaagaggga 


gcgactcatc 


tagccaggcc 


cggtccggtc 


60 


ctctgccctg 


cctggcgcga cccgttctcg 


tgcctatctg 


tggttctcta 


tcgctcttgt 


120 


gcctcgccct 


gcacctcctt ttcccattgt 


tgctgctttc 


tgccctgtgc 


tgcttggccg 


180 


ttcgttgtgc 


ccctcacctg tacactctcg 


cagccaagca 


ctgcagtggc 


agttcgcctc 


240 


cgcattcctt 


tcgtggccgc gtatcccccc 


cgtcatcttt 


ttcgtcggtg 


acagttcttt 


300 


gaaggttaga 


gcctctgtcc tgctgccgtt 


ctcgctgtgc 


ttgtgttgtg 


gccgacgatc 


360 


gggtttgttg 


tgcaaggtcg ctgtgcgcat 


cgtcttgttt 


agtattgtat 


gtcgattact 


420 


gtgttgtagg 


agcagtggct aagctttgtc 


cgctgatgtg 


gcacccaacg 


gcgtcgctca 


480 


agtgtaggct 


ttttctttac acgagcttgg 


tccgcgttta 


tggtgtttgg 


atgttacttt 


540 


tttcccgaat 


gacgatatgt tgtgatttct 


ttacaacaag 


agattttgtg 


acgtgaactg 


600 


tagtttgtgg 


attcgaaaag tgttgtttcc 


tcgtttttga 


tggacattac 


ttatgccttt 


660 


tagttgtcac 


ggttggtggc tttgcattct 


tggtcgtcat 


tagtttcatc 


cgatgctgga 


720 
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cattcgctac catcccaagc tgaagtgctg aagttgattt catatgttca gtttgctgtg 780 

tgcaccagta tgagtcaaaa ctgattggat gtcct'tcaca acttcattct cttcatctta 840 

aagtcgagta caaatcaata ggtacaggac tcctatattt tggtgttccg ccat:agt1:at 900 

cgtctttcgt caaaattacc ttattgagag gacttttcct tgcaaaggtc tcatcgagac 960 

caatctctca gagtcagata cctatggtcg cagcagaaat ctctagtcaa tgtttctaag 1020 

ctctcctaag gattttcgct ctttcatcag atgtattcta tccaactcca agttcgcaac 1080 

aatttcttca tacatcattg tcttctggtc tttctgttct gatactgcac cgattcattt 1140 

taggatctta taatccgtgc ttgatgtgcg gatatgtgaa ttccctgagt gttcacctca 1200 

acgtactcaa agttgttcta ctttcagcat ctttcagcca atgcggcaga tgcgatcact 1260 

tccgaggact ttaaaattct gtactgtttc tttaaaacgc ctttttcgat tctatgcagg 1320 

atcattgtaa gcg 1333 

<210> 20 
<211> 3289 
<212> DNA 

<213> Funaria hygroiaetrica 
<400> 20 

atgcatggca aaacaticccc tigtctticcaii gatigagaaag gcgaacctgg actgcttgat 60 

ggtcttccca ggtatctcat tgtgcttcgg tagttgttga cgtcttcact tctgcttctt 120 

tcgcttcctc ttcttcttct tcttcttctt ctttctctct ctctctctct ctctcccaaa 180 

ccttccttct gtcttccttc ctcttatttt cctatgtcaa tgaagtttag cacctcctaa 240 

aatttttgga tgctgttttt taaatagaag ggacgggatc aaaggacgag tgagtgtcgg 300 

cttttgcatt gcttccgttt tataacaacc tattaaggac gtagatcgtg tctgtaaagt 360 

catctcttat agccttttat: agtcttttta agagagaaga gccacctctg agtttcttat 420 

agattcggac aagagatgtg acgacttagg aagtgtcttt cggaattttt cttgtgataa 480 

tggcgttgca tttcttgtcc tgtcttattt ttaackgaac agtatgtacc atttttccgt 540 

atagtcctta ctttataata tgtcctcttt tctttcgcct cacgttcatc atattctttg 600 

atatgtacta ttaactttcg ctatctgttt tcttgtagtc ctttcaccgc gtgccgctat 660 

cacagcttgg tcatagagga ggcctcattt ccagctgacc aactcgagat tacagcatgg 720 

actgaggacg ggcttgtgat gggggttcgt cacaaagtct acaagcacat ccaaggagtg 780 

caatttcatc ctgagagcat ccggactcaa aatgggatgc agatcgttgg aaattttctc 840 

aagattttag atagaaaaga ggcggctgac aaggaaggag ctgaaatgaa aattttggag 900 

agtgtttgag tgatgagttg tactggtata tcttttcttg tgcaagattg ccagcatttg 960 

tcagcttgct tttgttagag tcctgacccc cagcgtataa ctccttgagt atatgcccaa 1020 
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gcaggcctag atgctgctgc aataaccttc tcggtgagac agggtagttt ttgaggtatt 1080 

tttgcacttc cagatggagc tactactaca aatatctatc cttatcttac gttaaactac 1140 

gatggaattg ccatgatcac tcaggtacgt ttaagttgtg attggacttt tagtgattac 1200 

tttcagagcg agctatcaaa ctggtgcttg gaggagcaac gcaaggaatg ctgaattttt 1260 

ctaatgatct aattcagctt aagtttttcg tcaaacttag tgatattttg aagttcatct 1320 

cgttagtgaa acatctcaaa gaagtacgcc attaaattat tgcagggctt gtgatgacat 1380 

tatttgatag tttacctctt aaactgagaa cgcattgctc tcctttgtat agttccagtc 1440 

atttgaaagc tctatttgct ctctgtaact taagccttgt tcaaggcatt taaattccct 1500 

cttccacga-k aaaaatggta gttatgttgc tggttggaac tttcaagata ccataacatt 1560 

gtggttctca ttcacaacgc aggaagtttg ttgacctata ttttitgaaag tggcgagtga 1620 

aattgtttac tcatcacttt atgtgtgttt ctagtatgtc acttcaattc cttcctcaac 1680 

tgtgcctaat ttttcatctc tgtgtgtcac gagcgtaatt tggcttagac gttggaacat 1740 

tctaaggttc cagtaaccag ttttcattta ttatttttaa attcacagcg cctcaagtaa 1800 

tgaaaggaca aacgccgatc attgcgcaac tc1:aat:tgtg acggtcttca agacaactaa 1860 

cggcaggtca ctctcttgtg atgttctcgt tgttgtcaaa cctgtataat ggcaattcat 1920 

ttcgacat:ca cggcaaactc atgatgg^tl: ttaacgtigat titgctcacca cctttcattc 1980 

aaagtta-tca ccgacaccct atgggtttaa ccatgttatc tgaaagcttt ctctacgtat 2040 

gtatgaatct gctcattagg gtgaatttgg aactliaaaga atctcacgcg atgtccatga 2100 

attttgttac tggacaacat atactgttga ccacatagat atgcatgttt agaactgcaa 2160 

aaaagtttgt tcacgaagac agaacgacta gaacgcagaa tacctgcgat cggtggaatg 2220 

■ggatcatttg cagtaaagct agtaaaggat: cgaaatiagac gcagagtaaa cccgatgcgt 2280 

tagaggggaa tgggagatcc acaggactcg gagagaaaat gcaaccctgc gggtaaaaat 2340 

agagaacgcg aggaggaagg gtagccagaa gagtttcacc gggatctaca gtataagccg 2400 

caaagggagc cacgggtact agtgccagct ttgcagcaga gagcgaacgc gagggagcga 2460 

acagatccgg gccccaaatc cccttcttct atctctcaag ccgtccacag ccttcattct 2520 

ccatcctcgc actattctcc tcacagcagt tgcatttgtg gttctctcca tcttcaaccc 2580 

ttcgactttg gtgcaagccc gcttgttatc taticccaagg tttcacgcac tccccccttc 2640 

gctgtgtgtt tcgttgcaat atttttggct ttagttttta ggtttataca tagtgcacat 2700 

gctctcgcaa aaccgtgccg cttcagggga tcgtggttct gtagacttga gcacagagat 2760 

gcgggtgaac tcttagtggt cgccgctgca tccccagagt agttatgcta cctaaagaag 2820 

cgtgctcgta cggtcgatat gtttagagat ggatatttag acgatggtgc gtgtcctgcg 2880 

gtcatcagag taggtgaagg gatttttcgt aagatctgct tttgtgacgg atctgcaatg 2940 
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caggaggtct gcgtctttct ttttcttcag cttcgtgccc aatgcgtcaa atgcgcaccc 3000 

attgcacaga gtgctattaa ggcggcttca tgaagctccc agttttgtga atcatgttaa 3060 

cttgtccact gatcagaacg ttcgggctgg catacgtgaa gcgaatacac atttttctac 3120 

agcatgttcc ttattttagt cttcatactc actgcttcga ttgccggagg gcctccatgt 3180 

tcgaccacat cttcacacgg ggcttatcat ctgacctaaa tcgcacgtgg cctctgtatt 3240 

gtgtcaatgc cagtaacagt ctttttgatg cgcagaacat ttcatctcc 3289 

<210> 21 
<211> 937 
<212> DNA 

<213> Marchantia polymorpha 
<400> 21 

catatgcgta cggagttgtg gtccccgatc gccgtagttg ctgttggtgt ctggtcacag 60 

aggattcttt gcttcgcttc ctaatgtagg tggccagggg tggatcgtct tcctcctacg 120 

cttcgtttgg acacatacat ctggatcttg agaggaacac gtgaattaga gttacatgcg 180 

gtattgcgtc atctttgcga ggtaacggcc gcgccgcaga cctagcggtt gcttctgcgc 240 

gactcaagga atcttccctc tcctgctcca tcactggaat gagagttgca gtctgatctt 300 

tgggaaatct ttcatcttgt tgaccatcga ctctgtcctc tcgatgaggt ctgggatgat 360 

tctgcatgtg atactagcgc agtcttcatg attgtcacat gcatccagat gcgacatctg 420 

gcgcgctttg fcgcttggtca tagccgcctt cttttatctt gatttgccta atgagcccca 480 

tttccagacg tggacggcag atcggtcata aggtccaaga gcaggaaatg ctatgaggcc 540 

gtttgcgtgg tctacctctg ctggcctgcg aaaagactgc ctgtccgact tcaatatctt 600 

taaacattiag gctcttcagt tgtctcgctc agaccattat tatgagttat tgttaccgta 660 

gtgtgttgct atgtcagccc gtgtagtctc gtcaatttct ggagggtaat gcgaacttgt 720 

tcatgacggc acgtatctcg tcgccccgaa gatcaccctt gttgagaagg atttcatgcg 780 

tctgcgtcct cgttcatgtt gacatgaatg atagaagccg ttctgaagac acgaaatgtg 840 

gttgacatat acattgtgat gctcatgtct tttgtcgagt caccaagatc cgcaaccatc 900 

tcatcttctt tcattttggt taggtaactt cgcgaaa 937 

<210> 22 
<211> 3025 
<212> DNA 

<213> Marchantia polymorpha 
<400> 22 

tcatgatgtt aagcgttttc ataatccaaa gaggttttgt atatagataa aatttacttt 60 

ctgaatatgc aagcatcata ttctaaattt aatcgaacat aattttttct gagctttctc 120 
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tttctttttc tttaaattaa atttccttca ctgcaatttt tttattacga ctcacacgag 180 

gagtattttc cgactataga tcttagggta tataactata tatcacgctc gttctaaaca 240 

ttttttctaa ttttatgaaa agagataaat atattaataa tataggttat ttagattatt 300 

gaaattcaca gaaaatacca tttttgtctc attcgatatg ttctagatgt gtgtgcgtat 360 

atggtcatat acttgggata tttttaaatt gtgaatacaa gattataaca aagttatcat 420 

tgcaaaatac taaagataag ttatctttgg tgagaagaca tgatatacca tctgcatatt 480 

acttattcac caattgacca aagatttaca atctaccttg atgaaccata aatttgagaa 540 

ttttatatgc agatatttgc ggat<;t1ttcc aatcattatc tagctcttgt ttacattttt 600- 

gctttLcacaa aaatgcaata atgtgaaagt tgatgcaata atccctttag gtttitttgac 660 

tcataacaat tttctctcca aagcatt:gag attcaatgtg gacgtgatac ataaattcac 720 

atcttgatita gttacatata aatgtggaac tgccgtat-tt gtcggaaagt t:cat:acaaiit 780 

ttttttgttc atttgaagat cataagatag ctgcatatat caccattagt gatgatatga 840 

tatatgacat gagaaaaata taacttaata tgaaggaagt cttgatatgc cttgctatcc 900 

ctaggttggg gtaggtcttt ctttcatttg cgattattat tactgtgagg aatatticggt 960 

agaatggatt ccttggaagt gttgtatttt tgacctctca taattaagca cagatitaatc 1020 

ccttcatttg tggtctatca atcaagtggt ctacgaatga ctctaatttt aagattattrt 1080 

ttgtagttgt gtggtgtttt agtagttacc aatcktatac ttgaaagaaa atgaaagcaa 1140 

tgattactca tactactcaa tgccaagatc ggaggctaaa tccaatgtat acaagtatag 1200 

aaatttgtaa agagttaagc tctttctttg ttcatgtagc tttgaggctt tgtaaaaata 1260 

tggacattga ttcggatata gaggtgagtt gtgcacaaga gatgaccata cttggtgtca 1320 

a^gtgtagca tttttttcag attatttata agaaaataat: caggaaagga aaataagtiag 1380 

tattcatcct agatataaca tt:tgt:cgaga aatciiacgag ataaacattt ttticagacga 1440 

gaacaatitict tcaaattttc agatgcaagg gtacgcattt agcattgcgc -bgatattaga 1500 

gctagtctcc tattgcatgt ttgatttcat acatgtacca cccattcttg ttactgcagt 1560 

gtgtgaaact tgttgaataa gaagttccgc aattatttca aattattgag agtcttctta 1620 

cataattttt acttatccaa aattcttaag aaccccacaa taaattcagt gatacgcttt 1680 

gaatggctca ccagttactg gactgccaca attcgcagca ttggagactt ggccaactca 1740 

accagagaag ggaccacgtc gaacgatcta cctccctccc agtgagtgag tgagtcttcg 1800 

ggtgcagtat tgtccaagtc ctggaatgtc gatccagccg caggaccagg aagatcgggc 1860 

cgggtacagt aaagttgcca taacaatccg gcaacgaacc acagatccgg gacgatctag 1920 

cgggaagttg aagtccaagg ctcggggcac atctccctgg tagaattaga atccatagcc 1980 

agaattctat ctcgaaacct tgtttcgcca gcgttatgat tataatcaag cgtccccgtt 2040 
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aatctgattc ctgtgaaagt tagttagtaa cttcataccc cagcattatg attataatca 2100 

agtgtctcag ttagtctgat tcctgtgaat gttagttagt aagttcaggc cttctcgtaa 2160 

tagcttcttg cgtataatct gaactgttga taatggttaa actcttgaat tacgacatat 2220 

cagtcccggg agattaatct gcttccgcta agctcgagga tgcacagcag taattttggg 2280 

tcgtttggga tttgataaaa cggacgggaa tatgcgtcgc gagttccgag taggagtgag 2340 

gaggaatgca aaccagcgga ccacgtaaag aggcccacga cagtccagca gcccagctgt 2400 

gagacacaag ggggacgaaa gggaccgccc aggccgacca cctgatgtca gggggagctg 24 60 

gtgcgagcgg cgacggacat ggatcggcgt ttggttgcgg tccagaagcg ggcgaggagg 2520 

gatccgcatg agtgacacag tgggggcaga attgggagaa gatcgtgggg gtaattgaga 2580 

ggggagat:tc gggttggggc cgagacaggt aaggaacacc gatgatgctg aggaaaatat 2640 

gaggaaiiticg tgagaatgcg acagggcgag agcactgtgg ggcagaatgg aaggggggcc 2700 

agcgatattc gagcaataaa ataagagcgg gggacattcg aaaagaggcc ccatataaag 2760 

ccgatcttcc attctgtttt cacagagctc ttcgtcgaac agagcctctc aaactcgctt 2820 

tgtgctccca gtgcttctgt ctcggatctg ctctgctcgg cttcgcgctt gttgttcttg 2880 

tgaccatcac cgccttcagg acgctcacgc ccaacgcaag aat:ttcgag1: cgaagtaagc 2940 

gagcagctca atcgcttcgt taacgcgttt gcggagatct tcgaggtttc gcgttcgaag 3000 

ttcttcggac acctccttcg ttaac 3025 

<210> 23 
<211> 909 
<212> DNA 

<213> Marchantia polymo3:pha 
<400> 23 

aagcttagca agcagctctc gcagcggatc tgctcttctg ctgctccctc tgcttcctcg 60 

tgctacacgg tcttcgtcct cgcttcctcc acgcttcctc gcgctctctc caggtactcg 120 

tcgcctcgcg ctctttcttc ttcctagttc gtccgttcct cgtaccggga tagggcggtc 180 

gcgggtctcg tgagggtttt ttcgagcaag gtgcgtgagc aagttcatat cggtgggcaa 240 

tgcatggggc gaacctggtc gggccctttt ccgaggccgc cggagagcct agtctccaag 300 

ctgtagtatc ggtgttctcg aagatcggtc ggtgtctgca tctctccatc tcgattcgtt 360 

tcgtctgagc tgatccgccg gtcgattttg acgatgtcgt gtcctcacct acgcaagttt 420 

ggttccgagg attagttttg aagatgctgt caatgggaag tttagctctt ggttcgtgat 480 

tagtttggac acggtcacat gaatcgtagg gacccaggtg tcgggcggaa tcttcagcag 540 

tcatttcggt ttccgtaacg ctggatttaa gctgaaaacg ttcatcgatg gattgcggat 600 

accatgacct aatggatcgt ccagcttatt cttctggaag tatagacgtg tgatggctgt 660 
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ggcctgtggt agggttggac acgcccgcag 


tggtctctcc gaatttgaat 


gtcgcaatgg 


720 


tcgatgtgct 


ctgccgattt ggggaatcga 


agtggcaaac cggtcgttcg 


gactgtcgag 


780 


tgtatgcctg ctgcttgtgc gatgtagtgt 


ggatttttcc tccgatgttt 


tccaaacgtg 


840 


gtcgggattg cagttcttca atctaccagc 


ggagctaatt tcgtctttgg 


cttgcagtct 


900 


atcgtcgat 








909 


<210> 24 
<211> 2146 
<212> DNA 

<213> Physcomitrella patiens 








<400> 24 
atacaagagt 


tataaatcat: atacaatgat: 


tactttcata taatrtgttga 


atattattgt 


60 


tacaacctaa 


gtaacaataa catitcaatta 


aacattcatit: gtggttttca 


agcatattaa 


120 


tcattctttc 


ttctctaccc tatagtgatg 


ggaaattatc ccaaacticaa 


tgtcatactc 


180 


caggcaattc 


agaaatatag tgagatgaat 


accaggaata tttattcaca 


tcgaccccta 


240 


tcgccgggca 


at:gccactcc caccgcggaa 


1:gagaaact:c cttigaaaaaa 


caagtccctt 


300 


cccagctgcc 


cgaaatcggc cgcctggtca 


gcacggcacg acactgccca 


cgtgcaatcc 


360 


tgacgtggcc 


tctacgtccg gaaggcggcg 


ccgttagcga tgtcctccta 


tgcaagttcc 


420 


tcttgtggcg 


gggcagtgtg cccgccaact 


tcaccgtcac cctccacccc 


aacaagtggc 


480 


ccaaattact 


caggggcagc ccagcttcga 


aattttaagc ggtgaccgcc 


ccttctcatc 


540 


gtcacgcgtt 


acttcttttt cactcaatcg 


agtctgttta ttattggccg 


clzaggaaatt 


600 


gcagcttcca 


actccgcatc accgcgtgca 


gtacagtgga gatcttcaag 


agtgtcctca 


660 


ccaggaattt 


gcaacttgct ccttgcaatt 


iigtaataaat ggacagagaa 


gcctagattc 


720 


cgcaticcaca 


gtgatgggtc acgtatcaat: 


aagcgaagct gcgttiggcaa 


ctatggcaat 


780 


tggtttggtg 


tcttcgttcc tgtcaagttt 


gaaaagaaga gggagatctg 


atttcttaat 


840 


aagtgtcgac 


ttgtctgggt agtggattgc 


gtggggcgtg tcgtagtgcg 


acgcgatcgc 


900 


atcaaattca 


tcgcctcaaa atttgtcacg 


ttgttgggtc aattgcaacg 


aactgcgatt 


960 


gaaggattct 


tctcggtggc cttcaaattt 


gctttagtat gacagaagtt 


ttgcagctgt 


1020 


actcggcgtt 


tggaaggagt ggaagtgagg 


tggatcacca cgcaccggag 


ttggtgaatt 


1080 


gtttactgca 


gaaaaaaatg gctttgatca 


ca'tcagaa'tg attgatgttt 


ea crct t craa t 

v.*dy Vi* fc« act ^ 


1140 

J. *a \J 


ttcacctcaa 


gatgtgttct catcatgaaa 


tttttattgg gccaggatgt 


actttcattg 


1200 


ttttgaaaga 


atattttaag acgcttgtgt 


tttacaacct ttcggaagat 


gcgtccttga 


1260 


ttgaaagtgg 


ttaatgtttt gtacatcatt 


actggatatg aaaataccaa 


taaaatgaaa 


1320 


tacaataaaa 


tatttttttg aaa-bgaaaat 


tggtttaaat aagcatgtaa 


ataatagacg 


1380 
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gtggagtaaa 
agtaagaggt 
ccaaccaata 
gactagtgac 
tatatactca 
cttcattatg 
tggacggaaa 
tactgcccga 
agtgcacggt 
gggcacacgg 
cttgctataa 
aatttgtttc 
gccgttgaat 



gaaaaggtaa 
gtccgtttgc 
cacaccattg 
ttacatatta 
cttgtttatt 
ataattclagt 
tggatggatg 
gtccgataaa 
cgaaggcggc 
tcaggaagaa 
gaacactgct 
gaccctgcct 
ttagtgattg 



taaaaaaaaa 
aagcaataaa 
ttttgctgca 
cttccaaacc 
tttaaaccat 
gacgttttaa 
taaatggaaa 
gacggaaaca 
gtttgtttac 
attgttcatg 
cctccgatct 
tgttgagtgg 
tgtagcgacc 



agtatgaatt 
aattcagtaa 
aagctagggt 
gaagcaaagc 
ctgaaatcac 
tctcattaaa 
gatcggcggc 
atgcgataaa 
atttacttca 
acggtcagaa 
aaacctcgga 
ccgtagaggc 
agtacgtcct 



ctattactct 

ttgctagata 

ttctaaggcc 

aagggtactc 

acaaaaatgt 

tttaatgcca 

aaaaagacca 

agtaaaagtg 

ccaaaaccga 

cattctggat 

ttgtgcgctt 

tcgacagtta 

gtaagg 



tcaatataaa 

aattcaaaag 

acaattcaat 

cacgattgta 

tgtgaccctg 

ccgtaggtta 

aattccatac 

agcagaagaa 

gcaggatatc 

ggttggcgtg 

ctagatactg 

ggatcagtgt 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2146 



<210> 25 
<211> 524 
<212> DNA 

<213> Funaria hygrometrica 
<400> 25 

gaattcattt ccattaacga gaatatgaca gtgggaagag cttccacgtc atccaaactc 
aaagtatccg acgtggtcaa' tccaagtgcc agtgccacct cagctccttc accagtccat 
ctcgcggata agggtgacag caaggcgcgg tattactgga taagagaagc ggccaaggcg 
gcagccactg tggtccactt tgctgcgtca ctacctactg cgattgtaat gacgagcggc 
agcgtcgtgt gacaggcttg aaccgaccgc tgcttcagcc gcaggcagac tagaaaagtt 
tactcgctgt cccactcgtt ttctgggtgt gcatccgaag tttctggatg gttgcccgtc 
gttcaataaa ttgtcgcgcg tcgagctagc ggacactttt gtcaccgttc ttctctgttt 
attctggacc agaggtgctg ttagctttgt tgtgtgtgag * tccttgggga aatccctgcg 
cgtcacgaga gtttattgca gggaagtgat aaagcgttgt gaag 

<210> 26 
<211> 2088 
<212> DNA 

<213> Physcoraitrella patens 
<400> 26 

atgcatgtaa gataattcca attagaatct ataaatttct tattataatt ttttaaaaac 
aaagtaccaa aatattatta ttttaatatc ctctaagtta aatccatata ttaagtagaa 



60 
120 
180 
240 
300 
360 
420 
480 
524 



60 
120 



27 



acaattattc taataaataa tgataaaaat tagacatctt gcaataaaat ttctttttaa 180 

aaatagatac ataacatgaa aaatatccca taaatagcta acaccat'caa aacatttgac 240 

caaatatgca cttttagatg tgtcaagaca aaaagaaata tttgcaagat tttggagtat 300 

ctaaactaat gtttgtcctc tttgcactat gagtaggatt tcttttattt tgtttagtga 360 

aaagatacat tgcaatttgt tttcataata aaaactatac taatgaaata gtgctaaaaa 420 

ataacaagat taaaaaaaca taacccttct tacaacctta aatccttcta attagactac 480 

ctcaaagttg tgccatttag cacaaaaacc attcttttaa atctacttaa ccctccaatt 540 

tccaatgagc ttcatgtgca tacacaagca tgctttcttt ctttctttct tgaagaaaac 600 

ttatctgaac aaacgttaat actctacttg ttgatgaaag t:ggaactttg accacatiaca 660 

ggcttggtga tgtactttgt atatctcctc acagttagtc tggtgcaatc caaccatgca 720 

catagaatat gaaiiggggac atgcttccag ccactcgggt gtgcagaaaa cttgacaagc 780 

gagattcaag caacggcgac tacgacgccg atcacgcaat acaaagcatt gttagtatgt 840 

gataaaccag agaaagagat cgagtatgtg cacacaaaaa cacacagatc cacaggtatt 900 

gtctacggcg ccaccaccat ccgtcaaagc taccatctcg tcgaggaaga atggtatttc 960 

taaaactagc aaliacaaccg ctrgatggaaa caaccgaaag ctatgtcatt ggagagggcg 1020 

cacgagttca tggaatacac agtgagaaga gataaagaaa taaaataata taaaatacaa 1080 

gtgtgcatca gcaagacatg gccgaaatzct: aacaactgtc tgcacatgct gtggtgggtt 1140 

gtatccacgc gctggaggaa gtaactttcc tacatgcaca gaaaaacat;t: ttcagattag 1200 

aaagctcttc tgttctagct aatctctagt; accaagctca gacgtgtagc cgacgaagcc 1260 

aatagcagct gggtatgcta gtcactgatt ctgaagcggc cggtgtgtcg attgcgatgt 1320 

atctcagttc ggcgaaggcc tgtgtctgga acatgggaag agggtcttct tgcactcgtc 1380 

aatctcticac agcaactggg cagggtitglia tccgaacgtg gaaaacgcag caaccgtitgt: 1440 

tgaaccaaag gatggtattt ttctccgaga aaaacgccgt ggcttatctg gtgtagacga 1500 

tccctaatcc ggacatgacc gccgctgtgc aggtgttggg aaaccacaat gcgcaagaga 1560 

tgcgagagat ggaggagtgc aagaagtacg actgcgaagc tacatgcttc atcgagcaat 1620 

gaagtctggg ttttctccaa cttccgcatg cacacacttt tctcgacgac atccgtttca 1680 

aggtacgcat cgggaaactg acgattctct gcactggtgt tcagactctc cggagaggcg 1740 

gtgtcatgtt ctgagctctt tttcgataag gtgctgttga agtccagaat: aatiggggtct 1800 

ggattatcct ctggacggct ccgcttctgg tcgaaaaaat ttcatcccaa aaaaggactt 1860 

atctgttgac tgaaaatgtt taattgtggt gaggattgca tgcagcgacg tcgtaaagat 1920 

agggtgacaa ggagcgttcc agagctcagc tcggggcatg ccccggcact ccctagcata 1980 

taaacatacc gggtggaatt tgtacccacc aggtcttgct cggtgtcccc tgtgcccaag 2040 
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ctgttggctg cattgccctt gcgattcgag tgtggagaga ttttagca 2088 

<210> 27 
<211> 500 
<212> DNA 

<213> Physcomitrella patens 
<400> 27 

ggaacgaatt tgtcgagctc tctggttctg ggtcgggtag cagtagcttt gatggtgagg 60 

cactgacagt cagtcgctca cacggcaaag tagcctggat gtgcttcgca acgaactcti: 120 

gaatttgagt atgtgagttc actttgaaca tcccagaagc aaaagaatgg gttttttcat 180 

gtttgaattt tattttgtat agttgtgttg agccgcgatt tctatctgtc acttggcttg 240 

atattctgag tttctccgat acgaatagcg aagtccactt gaacatctgt aacggcagca 300 

attgcgtcag gtcaatcctc tcagattctt tcggtgcttt tgtcgtaaac tagcttgatt 360 

gttgtccatt aagcttggtt gcttttcgtg agaaagcatg aaacttctat gacgaaaccc 420 

ggttgattgt aatgtaacta gtttgattgt agtttgaatt tggtaattgc gttgtatgat 480 

acataatgaa agtttcatga 500 
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